QUANTUM 
FIELD THE 


Claude Itzykson and 
Jean-Bernard Zuber 


DOVER BOOKS ON PHYSICS 


METHODS OF QUANTUM FIELD THEORY IN STATISTICAL Puysics, A.A. Abrikosov 
et al. (0-486-63228-8) 

DYNAMIC LIGHT SCATTERING: WITH APPLICATIONS TO CHEMISTRY, BIOLOGY, AND 
Puysics, Bruce J. Berne and Robert Pecora. (0-48641155-9) 

Quantum THEORY, David Bohm. (0-486-65969-0) 

Atomic Puysics (8TH EDITION), Max Born. (0-486-65984-4) 

EINSTEIN’S THEORY OF RELATIVITY, Max Born. (0-486-60769-0) 

INTRODUCTION TO HAMILTONIAN Optics, H.A. Buchdahl. (0-486-67597-1) 

THE METAPHYSICAL FOUNDATIONS OF MODERN SCIENCE, E. A. Burtt. (0-486-42551-7) 

MATHEMATICS OF CLASSICAL AND QUANTUM Physics, Frederick W. Byron, Jr. and 
Robert W. Fuller. (0-486-67164-X) 

PRIMER OF QUANTUM MEcHANIcs, Marvin Chester. (0-486-42878-8) 

THE THEORY OF THE ELECTROMAGNETIC FIELD, David M. Cook. (0-486-42567-3) 

MEcHANIcs, J.P. Den Hartog. (0-486-60754-2) 

MATHEMATICAL METHODS IN PHYSICS AND ENGINEERING, John W. Dettman. 
(0-486-65649-7) 

INVESTIGATIONS ON THE THEORY OF THE BROWNIAN MOVEMENT, Albert Einstein. 
(0-486-60304-0) =~ 

THE PRINCIPLE OF RELATIVITY, Albert Einstein, et al. (0-486-60081-5) 

THE Puysics OF WAVES, William C. Elmore and Mark A. Heald. (0-486-64926-1) 

THERMODYNAMICS, Enrico Fermi. (0-486-60361-X) 

QUANTUM THEORY OF MANY-PARTICLE Systems, Alexander L. Fetter and John 
Dirk Walecka. (0-486-42827-3) 

INTRODUCTION TO MODERN Optics, Grant R. Fowles. (0-486-65957-7) 

DIALOGUES CONCERNING TWO NEW SCIENCES, Galileo Galilei. (0-486-60099-8) 


Group THEORY AND ITS APPLICATION TO PHYSICAL PROBLEMS, Morton 
Hamermesh. (0-486-66181-4) 

ELECTRONIC STRUCTURE AND THE PROPERTIES OF SOLIDS: THE PHYSICS OF THE 
CHEMICAL Bonb, Walter A. Harrison. (0-486-6602 1-4) 

Soup Stare THEORY, Walter A, Harrison, (0486-63948-7) 

PHYSICAL PRINCIPLES OF THE QUANTUM THEORY, Werner Heisenberg. 
(0-486-601 13-7) 

ATOMIC SPECTRA AND ATOMIC STRUCTURE, Gerhard Herzberg. (0-486-601 15-3) 

AN INTRODUCTION TO STATISTICAL THERMODYNAMICS, Terrell L. Hill. 
(0-486-65242-4) 

EQUILIBRIUM STATISTICAL MECHANICS, E. Atlee Jackson. (0-486-41185-0) 

OPTICS AND OPTICAL INSTRUMENTS: AN INTRODUCTION, B.K. Johnson. 
(0-486-60642-2) 

THEORETICAL SOLID STATE Puysics, VOLs. I & II, William Jones and Norman H. 
March. (0-486-65015-4, 0-486-65016-2) 

THEORETICAL Puysics, Georg Joos, with Ira M. Freeman. (0-486-65227-0) 

THE VARIATIONAL PRINCIPLES OF MECHANICS, Cornelius Lanczos. (0-486-65067-7) 

What Is RELATIVITY? L. D. Landau and G. B. Rumer. (0-486-42806-0) 

ELEMENTS OF GASDYNAMICS, H. W. Liepmann and A. Roshko. (0-486-41963-0) 

THE KINETIC THEORY OF Gases, Leonard B. Loeb. (0-486-49572-8) 


(continued on back flap) 


~ 


Digitized by the Internet Archive 
in 2022 with funding from 
Kahle/Austin Foundation 


https://archive.org/details/quantumfieldtheo0000itzy i9a1 


QUANTUM 
FIELD THEORY 


QUANTUM 
FIELD THEORY 


Claude Itzykson and 
Jean-Bernard Zuber 


Commissariat a |’Energie Atomique 
Centre d’Etudes Nucléaires de Saclay 


DOVER PUBLICATIONS, INC. 
Mineola, New York 


Copyright 


Copyright © 1980 by McGraw-Hill, Inc. 
All rights reserved. 


Bibliographical Note 


This Dover edition, first published in 2005, is a republication of the work 
originally published by McGraw-Hill, Inc., New York, in 1980. A new preface 
to the Dover Edition and list of errata have been added. 


International Standard Book Number: 0-486-44568-2 


Manufactured in the United States of America 
Dover Publications, Inc., 31 East 2nd Street, Mineola, N.Y. 11501 


CONTENTS 


Preface to the Dover Edition xix 
Preface XXxi 
General References XXili 
Errata XXV 
1 CLASSICAL THEORY 1 

1-1 PRINCIPLE OF LEAST ACTION 1 
1-1-1 Classical Motion 1 


Lagrange and Hamilton’s equations. Poisson brackets. 
Relativistic point particle. Canonical transformations. 


1-1-2 Electromagnetic Field as an Infinite Dynamical System 7 
Maxwell’s equations. Gauge invariance. Electromagnetic action. 
1-1-3 Electromagnetic Interaction of a Point Particle 13 


Lorentz force law. Constant field. Gyromagnetic ratio. Thomas 
precession. Bargmann-Michel-Telegdi equation. Interaction 
with a'plane wave. 


1-2 SYMMETRIES AND CONSERVATION LAWS 19 


1-2-1 Fundamental Invariants 19 
Time translation and energy conservation. Momentum and 
angular momentum. 

1-2-2 Energy Momentum Tensor - 
Canonical tensor. Electromagnetic symmetric tensor. Closed 
systems. Generalized angular momentum. Hamiltonian form. 

1-2-3 Internal Symmetries 2 
Infinitesimal transformations. Noether’s theorem. Charges. 
Lie algebra with Poisson brackets. Minimal coupling. 


vi CONTENTS 


1-3 PROPAGATION AND RADIATION 


2-2 


2-3 


2-4 


1-3-1 Green Functions 
Superposition principle. Retarded and advanced Green 
functions. Feynman’s propagator. 

1-3-2 Radiation 
Lienard-Wiechert potentials. Larmor’s formula for the radiated 
power. Thomson scattering. Bremsstrahlung. Self-interaction. 
Lorentz-Dirac equation. Runaway solutions. Acausal effects. 


2 THE DIRAC EQUATION 


TOWARD A RELATIVISTIC WAVE EQUATION 


2-1-1 Quantum Mechanics and Relativity 

Klein-Gordon equation. Negative energies. Indefinite norm. 
2-1-2 The Dirac Equation 

y matrices. Conserved current with positive density. 
2-1-3 Relativistic Covariance 

Transformation laws. Mass and spin. Parity. Bilinear densities. 


PHYSICAL CONTENT 


2-2-1 Plane Wave Solutions and Projectors 
Positive and negative energy solutions. Projectors on states 
with definite momentum and polarization. 

2-2-2 Wave Packets 
Group velocity. Zitterbewegung and Klein’s paradox. 

2-2-3 Electromagnetic Coupling 
Minimal coupling. Nonrelativistic limit and Pauli equation. 
Solutions in a constant magnetic field and in an electromagnetic 
plane wave. 

2-2-4 Foldy-Wouthuysen Transformation 
Relativistic corrections. Spin-orbit coupling. Darwin term. 


HYDROGEN-LIKE ATOMS 


2-3-1 Nonrelativistic Versus Relativistic Spectrum 
Balmer’s spectrum. Predictions of the Klein-Gordon equation. 
2-3-2 Dirac Theory 
Fine structure splitting. Relativistic wave functions. Hyperfine 
splitting and radiative effects. 


HOLE THEORY AND CHARGE CONJUGATION 


2-4-1 Reinterpretation of Negative Energy Solutions 
Holes and positrons. Many-body theory. 


32 
32 


36 


45 
45 
45 


48 


51 


55 
55 


60 


64 


69 


ae 
72 


74 


84 
84 


3-1 


CONTENTS vii 


2-4-2 Charge Conjugation 
Charge conjugation operator and invariance of the Dirac 
equation. 

2-4-3 Zero-Mass Particles 
Chirality. Two-component theory. CP invariance. 


DIRAC PROPAGATOR 


2-5-1 Free Propagator 
Retarded and Feynman propagator from the hole theory point 
of view. 
2-5-2 Propagation in an Arbitrary External Electromagnetic Field 
Perturbation expansion of the propagators in an external field. 
2-5-3 Application to the Coulomb Scattering 
Classical Rutherford scattering. Mott’s formula. 
2-5-4 Fock-Schwinger Proper Time Method 
Propagation in a constant uniform field or a plane wave field. 


3. QUANTIZATION—FREE FIELDS 


CANONICAL QUANTIZATION 


3-1-1 General Formulation 
Conjugate momenta and quantization. One-dimensional 


crystal. Phonons. Normal ordering. Bose symmetry. Fock space. 


3-1-2 Scalar Field 
Invariant measure. Relativistic invariance. Field commutator. 
States with fixed number of particles. Coherent states. Field 
distribution. 

3-1-3 Charged Scalar Field 
U(1) invariance. Conserved charge. Particles and antiparticles. 
Charge conjugation. 

3-1-4 Time-Ordered Product 
Feynman’s propagator recovered. 

3-1-5 Thermodynamic Equilibrium 
Bose-Einstein distribution. Equation of state. Propagators at 
finite temperature. 


QUANTIZED RADIATION FIELD 


3-2-1 Indefinite Metric 
Gupta-Bleuler quantization. Transverse photons. 
3-2-2 Propagator 
Feynman gauge propagator. Generalization to arbitrary A. 


85 


87 


89 
89 


93 


94 


100 


105 


105 
107 


114 


120 


123 


124 


127 
127 


133 


Vili CONTENTS 


3-2-3 Massive Vector Field 
Proca and Stueckelberg equations. Zero-mass limit. Upper 
limit on the photon mass using the constant field method. 
3-2-4 Vacuum Fluctuations 
Casimir effect. Van der Waals forces. 


3-3 DIRAC FIELD AND EXCLUSION PRINCIPLE 


3-3-1 Anticommutators 
Dirac field quantization and positivity of the energy. 
3-3-2 Fock Space for Fermions 
Charge and spin. Antisymmetric states and Fermi-Dirac 
statistics. 
3-3-3 Relation between Spin and Statistics— Propagator 
The connection between spin and statistics as implied by 
local quantum theory. 


3-4 DISCRETE SYMMETRIES 


3-4-1 Parity a 
3-4-2 Charge Conjugatio 
3-4-3 Time Reversal 
3-4-4 Summary 


Transformation properties of quadratic forms. PCT theorem. 


Form factors. Fierz reshuffling theorem. 


4 INTERACTION WITH AN EXTERNAL 
FIELD 


4-1 QUANTIZED ELECTROMAGNETIC FIELD INTERACTING 
WITH A CLASSICAL SOURCE 


4-1-1 Emission Probabilities 
S matrix. Poisson distribution. 
4-1-2 Emitted Energy and the Infrared Catastrophe 
Finite energy versus infinite photon number. 
4-1-3 Induced Absorption and Emission 
Stimulated emission. 
4-1-4 § Matrix and Evolution Operator 
Interaction representation. General form of the S matrix. 


4-2 WICK’S THEOREM 


4-2-1 Bose Fields 
Wick’s identities from a generating functional. 


134 


138 


142 
143 


146 


149 


oil 


15d 
1S2 
154 
156 


163 


163 
163 


170 


174 


176 


180 
180 


4-3 


CONTENTS ix 


4-2-2 Fermi Fields 
Grassmann algebra formalism. 
4-2-3 General Case 


QUANTIZED DIRAC FIELD INTERACTING WITH A 
CLASSICAL POTENTIAL 


4-3-1 General Formalism 
Fredholm determinant. Gauge invariance and unitarity. 
4-3-2 Emission Rate to Lowest Order 
4-3-3 Pair Creation in a Constant Uniform Electric Field 
Nonperturbative tunneling effect. 
4-3-4 The Euler-Heisenberg Effective Lagrangian 


Effective action for a slowly varying field. Field renormalization. 


Fourth-order expression. 


5 ELEMENTARY PROCESSES 


S MATRIX AND ASYMPTOTIC THEORY 


5-1-1 Cross Sections 
In- and out-states. S and T matrices. Invariant cross sections. 
5-1-2 Asymptotic Theory 
Kallen-Lehmann representation for the vacuum expectation 
value of field commutators. 
5-1-3 Reduction Formulas 
Lehmann-Symanzik-Zimmermann formalism. 
5-1-4 Generating Functional 
5-1-5 Connected Parts 
5-1-6 Fermions 
5-1-7 Photons 
Current conservation and covariant time-ordered product. 


APPLICATIONS 


5-2-1 Compton Effect 
The Klein-Nishina relativistic formula. 
5-2-2 Pair Annihilation 
Dirac formula. 
5-2-3 Positronium Lifetime 
Singlet and triplet lifetimes to lowest order. 
5-2-4 Bremsstrahlung 
The Bethe-Heitler cross section for photoemission in the 
nuclear Coulomb field. 


182 


185 


185 
185 


191 
193 


125 


198 


198 
199 


202 


205 
208 
2h 


213 
oA) 


224 
224 


230 


233 


238 


X CONTENTS 


5-3 UNITARITY AND CAUSALITY 


6-2 


5-3-1 Unitarity and Partial Wave Decomposition 
Optical theorem. Jacob and Wick partial wave expansion. 
5-3-2 Causality and Analyticity 
Retarded commutators. Support properties in configuration 
and momentum space. Crossing symmetry. 
5-3-3 The Jost-Lehmann-Dyson Representation 
5-3-4 Forward Dispersion Relations 
General derivation. Application to pion-nucleon scattering. 
Determination of the effective pion-nucleon coupling. 
5-3-5 Momentum Transfer Analyticity 
Small and large Lehmann ellipses. Froissart bound. 
Pomeranchuk theorem. 


6 PERTURBATION THEORY 


INTERACTION REPRESENTATION AND 
FEYNMAN RULES 


6-1-1 Self-Interacting Scalar Field 
Perturbative series in the interaction picture. Feynman 
rules in configuration and momentum space. Symmetry factors. 
6-1-2 Feynman Rules for Spinor Electrodynamics 
Furry’s theorem. 
6-1-3 Electron-Electron and Electron-Positron Scattering 
M@ller’s and Bhabha’s formulas. 
6-1-4 Scalar Electrodynamics 
Derivation of covariant perturbation rules and computation 
of elementary processes in scalar electrodynamics. 


DIAGRAMMATICS 


6-2-1 Loopwise Expansion 
Counting the number of loops in terms of powers of Planck’s 
constant. 

6-2-2 Truncated and Proper Diagrams 


Topological and algebraic definitions. Legendre transformation. 


6-2-3 Parametric Representation 
General expression of Feynman integrals in terms of 
parameters. 

6-2-4 Euclidean Green Functions 
The Wick rotation performed term by term in the perturbation 
expansion. 


241 
241 


245 


249 
250 


296 


261 


261 
262 


271 


276 


282 


287 
287 


PRE. 


294 


299 


6-3 


7-1 


7-2 


ied 


CONTENTS xi 


ANALYTICITY PROPERTIES 


6-3-1 Landau Equations 

6-3-2 Real Singularities 
Normal and anomalous thresholds. 

6-3-3 Real Singularities of Simple Diagrams 
One-loop bubble, vertex, and box diagrams. The Mandelstam 
representation. 

6-3-4 Physical-Region Singularities. Cutkosky Rules 


7 RADIATIVE CORRECTIONS 


ONE-LOOP RENORMALIZATION 


7-1-1 Vacuum Polarization 
Vacuum polarization tensor. Gauge-invariant regularization. 
Renormalized charge. Effect on the Lamb shift. 

7-1-2 Electron Propagator 
Gauge-dependence and infrared problems of the electron 
self-energy. 

7-1-3 Vertex Function 
Ward identity. On-shell vertex function. Form factors. 

7-1-4 Summary 
Counterterms and bare lagrangian. Power counting. 


RADIATIVE CORRECTIONS TO THE INTERACTION 
WITH AN EXTERNAL FIELD 


7-2-1 Effective Interaction and Anomalous Magnetic Moment 
7-2-2 Radiative Corrections to Coulomb Scattering 
The «? corrections to Mott’s formula. 
7-2-3 Soft Bremsstrahlung 
7-2-4 Finite Inclusive Cross Section 


NEW EFFECTS 


7-3-1 Photon-Photon Scattering 
Low- and high-energy behavior. Explicit expression in the 
zero frequency limit. 
7-3-2 Lamb Shift 
Radiative corrections to bound-states energy levels and 
computation of the Lamb shift to leading «(Za)* order. 
7-3-3 Van der Waals Forces at Large Distances 
Computation of the coefficient of the r~ 7 law in terms 
of static susceptibilities. 


301 
302 


305 


308 


313 


318 


358 


365 


Xii_ CONTENTS 


8-1 


8-2 


8-3 


8-4 


8 RENORMALIZATION 


REGULARIZATION AND POWER COUNTING 


8-1-1 Introduction 

8-1-2 Regularization 
A review of the various procedures. 

8-1-3 Power Counting 
Superficial degree of divergence. Renormalizable and 
nonrenormalizable interactions. 

8-1-4 Convergence Theorem 


RENORMALIZATION 


8-2-1 Normalization Conditions and Structure of the Counterterms 
The importance of the normalization conditions and their 
arbitrariness. Multiplicative character of the renormalization 
operation. 

8-2-2 Bogoliubov’s Recursion Formula 
The R operation. 

8-2-3 Zimmermann’s Explicit Solution 

8-2-4 Renormalization in Parametric Space 
Form of Zimmermann’s operation in parametric space. 
Indications on the Bogoliubov-Parasiuk-Hepp convergence 
theorem. 

8-2-5 Finite Renormalizations 

8-2-6 Composite Operators 
Multiplicative renormalization in matrix form. 


ZERO-MASS LIMIT, ASYMPTOTIC BEHAVIOR, 
AND WEINBERG’S THEOREM 


8-3-1 Massless Theories 


Infrared power counting. Kinoshita-Lee-Nauenberg theorem. 


8-3-2 Ultraviolet Behavior and Weinberg’s Theorem 


THE CASE OF QUANTUM ELECTRODYNAMICS 


8-4-1 Formal Derivation of the Ward-Takahashi Identities 

8-4-2 Pauli-Villars Regularization to All Orders 

8-4-3 Renormalization 

8-4-4 Two-Loop Vacuum Polarization 
An illustrative computation carried out in massless quantum 
electrodynamics, using dimensional regularization. . 


372 


372 


392 
374 


372 


382 


385 
385 


389 


302 
394 


397 
oe) 


402 
402 


405 


407 


407 
411 
413 
415 


CONTENTS xiii 


9 FUNCTIONAL METHODS 


9-1 PATH INTEGRALS 


9-1-1 The Role of the Classical Action in Quantum Mechanics 
Matrix element of the time evolution operator. Path integral 
over classical trajectories. Harmonic oscillator and evolution 
under the action of an external force. Differential formulation. 
Scattering. Eikonal approximation. 

9-1-2 Trajectories in the Bargmann-Fock Space 
Hilbert space of analytic functions. Analytic kernel of an 
operator. Normal form. Application to path integrals. 

9-1-3 Fermion Systems 
Exterior algebra. Derivatives and integrals. Analytic functions. 
Transition amplitudes. 


9-2 RELATIVISTIC FORMULATION 


9-2-1 S Matrix and Green Functions in Terms of Path Integrals 
The generating functional as a path integral. Equations of 
motion. 

9-2-2 Effective Action and Steepest-Descent Method 
Perturbation theory as the semiclassical expansion around zero 
classical field. Effective potential. Explicit computation up to 
order h?. 


9-3 CONSTRAINED SYSTEMS 


9-3-1 General Discussion 
Canonical structure of constrained systems. Auxiliary 
conditions. Quantization in terms of the original variables. 
9-3-2 The Electromagnetic Field as an Example 


9-4 LARGE ORDERS IN PERTURBATION THEORY 


9-4-1 Introduction 
Divergence of perturbation theory. Borel transformation. 
9-4-2 Anharmonic Oscillator 
Feynman-Kac formula. Euclidean action. Classical solutions 
of finite action. Fluctuations around a degenerate extremum. 
The Bender and Wu expression for large orders of the 
ground-state energy. 


425 


425 


425 


435 


439 


443 


443 


448 


456 


456 


461 


463 


463 


467 


Xiv CONTENTS 


10-1 


10-2 


10-3 


11-2 


10 INTEGRAL EQUATIONS AND BOUND-STATE 
PROBLEMS 


THE DYSON-SCHWINGER EQUATIONS 


10-1-1 Field Equations . 
Functional methods and integral equations. 
10-1-2 Renormalization 


RELATIVISTIC BOUND STATES 


10-2-1 Homogeneous Bethe-Salpeter Equation 
Derivation of the equation. Normalization of the solutions. 
10-2-2 The Wick Rotation 
Stability conditions and rotation to euclidean variables. 
10-2-3 Scalar Massless Exchange in the Ladder Approximation 
The Wick-Cutkosky model. Conformal transformations. 
One-dimensional reduction. Abnormal solutions. 


HYPERFINE SPLITTING IN POSITRONIUM 


10-3-1 General Setting 
Instantaneous approximation. Salpeter’s perturbation 
theory. 

10-3-2 Calculation to Order «° 
The Karplus and Klein expression for the singlet-triplet 
ground-state splitting. 


11 SYMMETRIES 


QUANTUM IMPLEMENTATION OF SYMMETRIES 


11-1-1 Statement of the Problem 
11-1-2 Behavior of the Ground State 
Coleman’s theorem. 


MASS SPECTRUM, MULTIPLETS, AND 
GOLDSTONE BOSONS 


11-2-1 The Octet Model of Gell-Mann and Ne’eman 
Isospin and unitary symmetry. Representations and weight 
diagrams. The mass formula of Gell-Mann and Okubo. 
11-2-2 Spontaneous Symmetry Breaking 


Goldstone’s theorem. Instabilities from quantum corrections. 


Lower critical dimension. 


475 


475 
476 


48] 


481 
482 


485 


487 


493 
495 


500 


509 


509 


510 
S12 


513 
513 


Bl) 


CONTENTS Xv 


11-3 CURRENT ALGEBRA 


11-4 


11-3-1 


11-3-2 


11-3-3 


Current Commutators 

Algebra of charges and currents. Schwinger terms. The 
sum rules of Cabibbo and Radicati, of Adler, and of Bjorken. 
Approximate Conservation of the Axial Current 

and Chiral Symmetry 

The pion as a Goldstone particle. Goldberger-Treiman 
relation and pion pole dominance. 

Low-Energy Theorems and Sum Rules 

Low-energy theorem for Compton scattering. Drell-Hearn 
sum rule. Pion-nucleon scattering lengths and the Adler- 
Weisberger relation. 


THE o MODEL 


11-4-1 


Description of the Model 
The lagrangian of Gell-Mann and Lévy. Explicit and 
spontaneous symmetry breaking. 


11-4-2 Renormalization 


Ward identities. Soft and hard breaking. Nonlinear 
o model. 


ANOMALIES 


11-5-1 


11-5-2 


11-5-3 


The x° > 2) Decay and Current Algebra 

The Veltman-Sutherland paradox. 

Axial Anomaly in the o Model 

Explicit calculation of the Adler-Bell-Jackiw anomaly 
in a model. 

General Properties 


12 NONABELIAN GAUGE FIELDS 


CLASSICAL THEORY 
12-1-1 The Gauge Field 4, and the Tensor F,,, 


12-1-2 
12-1-3 


12-1-4 


Geometrical interpretation. 

Classical Dynamics 

Equations of motion and energy momentum tensor. 
Euclidean Solutions to the Classical Equations of Motion 
Degeneracy of the ground state and instantons. 

Gauge Invariance and Constraints 


526 
526 


534 


536 


540 
541 


543 


549 
550 


531 


556 


562 


562 
563 


567 


569 


573 


xvi CONTENTS 


12-2 QUANTIZATION OF GAUGE FIELDS 


12-2-1 Constrained Quantization 
12-2-2 Integration over the Gauge Group 

The Faddeev-Popov determinant and ghosts. 
12-2-3 Feynman Rules . 

Covariant and axial gauges. 


12-3 THE EFFECTIVE ACTION AT THE ONE-LOOP ORDER 


12-3-1 General Form 
12-3-2 Two-Point Function 
12-3-3 Other Functions 
12-3-4 One-Loop Renormalization 
Universality of coupling constant renormalization. 


12-4 RENORMALIZATION 


12-4-1 Slavynov-Taylor Identities 
Becchi-Rouet-Stora transformation. 

12-4-2 Identities for Proper Functions 

12-4-3 Recursive Construction of the Counterterms 

12-4-4 Gauge Dependence of Green Functions 

12-4-5 Anomalies 


12-5 MASSIVE GAUGE FIELDS 


12-5-1 Historical Background 
Massive vector fields as intermediate bosons of weak or 
strong interactions. 

12-5-2 Massive Gauge Theory 
Nonrenormalizability of massive vector theories. 

12-5-3 Spontaneous Symmetry Breaking 
Group theory of the Higgs phenomenon. The’t Hooft- 
Polyakov monopole solutions. 

12-5-4 Renormalization of Spontaneously Broken Gauge 
Theories 

12-5-5 Gauge Independence and Unitarity of the § Matrix 
The ’t Hooft gauge. 


12-46 THE WEINBERG-SALAM MODEL 


12-6-1 The Model for Leptons 

12-6-2 Electron-Neutrino Cross Sections 

12-6-3 Higher-Order Corrections 

12-6-4 Incorporation of Hadrons 
Charm. Cabibbo coupling. 


CONTENTS XVii 


13 ASYMPTOTIC BEHAVIOR 632 
13-1 EFFECTIVE CHARGE IN ELECTRODYNAMICS 633 
13-1-1 The Gell-Mann and Low Function 633 
Renormalization group. The function wy to lowest orders. 
Landau ghost. 
13-1-2 The Callan-Symanzik Equation 637 
The f function in electrodynamics. 
13-2 BROKEN SCALE INVARIANCE 641 
13-2-1 Scale and Conformal Invariance 642 


Classical scale invariance implies invariance under the 
full conformal group. 


13-2-2 Modified Ward Identities 645 
Derivation of the Callan-Symanzik equation in the general 
case. 

13-2-3  Callan-Symanzik Coefficients to Lowest Order 651 


Computation in g* theory and nonabelian gauge fields. 


13-3 SCALE INVARIANCE RECOVERED 654 

13-3-1 Coupling Constant Flow 654 
Ultraviolet and infrared fixed points. Anomalous dimensions. 

13-3-2 Asymptotic Freedom 657 


Necessity of nonabelian gauge fields for asymptotic freedom. 
Leading logarithmic corrections. 

13-3-3 Mass Corrections 659 
Wilson’s criterion. 


13-4 DEEP INELASTIC LEPTON-HADRON SCATTERING 
AND ELECTRON-POSITRON ANNIHILATION INTO 


HADRONS ~ 

13-4-1 Electroproduction ao 
Scaling behavior of the structure functions. 

13-4-2 Light-Cone Dynamics ee 
Parton model predictions. 

13-4-3 Electron-Positron Annihilation 668 


The ratio R = 65+¢-_, padrons/Tete- + utp» Corrections to Roo. 


13-5 OPERATOR PRODUCT EXPANSIONS 671 


13-5-1 Short-Distance Expansion 672 
Zimmermann’s derivation on an example. 


Xvili CONTENTS 


13-5-2 Dominant and Subdominant Operators, Operator Mixing, 
and Conservation Laws 

13-5-3 Light-Cone Expansion 
Wilson’s expansion and the moments of the structure 
functions. 


APPENDIX 


A-1 METRIC 
A-2 DIRAC MATRICES AND SPINORS 


A-3 NORMALIZATION OF STATES, S MATRIX, UNITARITY, 
AND CROSS SECTIONS 


A-4 FEYNMAN RULES 
INDEX 


678 
681 


691 


691 
692 


696 
698 
701 


PeerPAscne, TO Tk DOVER EDITION 


It is a great pleasure to see this book, first published a quarter of 
a century ago by McGraw-Hill, now being reprinted by Dover Publica- 
tions, Inc. Of course, in the meantime, other textbooks of quantum feld 
theory have appeared and the reader has a wide choice among several 
excellent and modern references. It is, however, my sincere hope and se- 
cret ambition that the present book will continue to be useful to many 
readers. 

The present edition is a mere reprint of the original McGraw-Hill 
edition. Aside from a list of errata, that the reader will find beginning 
on page xxv, we have not attempted to revise the text. Obviously, there 
are several sections which we would write differently today. Perhaps the 
most dramatic obsolescence in this book is the discussion of non-abelian 
anomalies in chapter XI, which has been completely outdated by the 
progress in its understanding made in the early eighties. The reader is 
invited to consult the literature to fill this gap. 

The new printing takes place in a special year, since 2005 marks the 
tenth anniversary of the death of Claude Itzykson in May 1995. It is with 
particular emotion that I devote this new edition to the memory of a dear 
friend and a great physicist, who initiated the lectures which led to this 
book and was the main driving force in the whole project. 


Saclay/Jussieu, August 2005 
Jean-Bernard Zuber 
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PREFACE 


Quantum field theory has remained throughout the years one of the most 
important tools in understanding the microscopic world. The recent years have 
seen a blossoming of developments and applications which go far beyond the 
original scope. 

Our attempt at presenting a pedagogical survey of this subject arose from 
lectures given in Orsay and Saclay, and first materialized in a set of notes written 
in French. As with any such endeavor it reflects to a large extent our prejudices 
and enthusiasms, even though we have tried to be as thorough as possible and 
to keep a balance between the formalism and practical examples of calculations. 

In its present version this book addresses itself both to students and researchers 
in field theory, particle physics, and related areas. It presupposes a general back- 
ground in quantum mechanics, electrodynamics, and relativity, and assumes some 
familiarity with classical calculus including group theory and complex analysis. 
To avoid one-sided views, we have respected the historical perspective and given 
equal weight to the operator formulation, propagator approach, and more 
synthetic path integral representation. Nevertheless, this is by no means a complete 
treatise. We list some of the main omissions. For lack of space and because we 
felt incompetent we do not treat axiomatic field theory and critical phenomena 
in statistical mechanics. Specific topics such as a detailed study of the Poincare 
group and its representations, higher spin fields, a thorough discussion of infrared 
problems, gravitational interactions, two-dimensional models, etc., could not be 
discussed here. 

The book is roughly divided into two parts. The first eight chapters cover 
the standard quantization of electrodynamics and culminate in the perturbative 


xxi 


XXii PREFACE 


renormalization. The remaining five chapters are devoted to functional methods, 
relativistic bound states, broken symmetries, nonabelian gauge fields, and asymp- 
totic behavior. 

Sections of the text in smaller type contain exercises, comments, and sug- 
gestions. It was thought inappropriate to have a separate set of problems. The 
latter often consist in completing derivations and extending or applying some 
methods, and are therefore more usefully related directly with the main discus- 
sion—the more so as one progresses toward the recent and richer aspects of 
the theory. 

To give some perspective, provide our sources, and suggest some further 
reading, each chapter closes with a section entitled “notes,” which collects a 
number of references. Needless to say, they are not meant to be complete in any 
way. We present our apologies in advance to all of those who will not find their 
work properly quoted. 

We owe, of course, a great deal to many of the excellent textbooks on the 
subject. Some overlap was unavoidable. This is also true of several review articles 
and lecture notes. 

We hope that the reader will not suffer too much from our insufficient mastery 
of the English language. In the process of translation we have undoubtedly lost 
some accuracy in style that only belongs to one who expresses himself in his 
mother tongue. 

The stimulating atmosphere of the Service de Physique Theorique in Saclay 
has been essential in developing our interest in field theory, and we have a great 
debt toward our friends and colleagues past and present. To R. Stora our teacher 
and example, to M. Bander, A. Carroll, G. Mahoux, G. Parisi, N. Salingaros, 
and V. Ganapathi who read the manuscript, and to the students and physicists 
who pointed out misprints, made comments and suggestions, and who cannot all 
be quoted here, we express our warmest thanks. We are also grateful to T. Kinoshita 
who was kind enough to provide us with a review of recent results in quantum 
electrodynamics. 

This enterprise would have been impossible without the help of our laboratory. 
We thank above all Brigitte Bunel, Danielle Lechaton, and Jeanine Rigou, who 
carried the burden of typing the manuscript, Madeleine Porneuf who supervised 
the project, and the Commissariat a |’Energie Atomique and its staff who 
supported it. 


Claude Itzykson Saclay 
Jean-Bernard Zuber March, 1978 
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CHAPTER 


ONE 
CLASSICAL THEORY 


In this chapter we survey the lagrangian formalism in classical mechanics and 
its extension to systems with infinitely many degrees of freedom, emphasizing 
symmetries and conservation laws in their local form. Using electromagnetism 
as an example, we introduce Green functions and propagators. Elementary 
radiation problems are presented, and we close the chapter by studying the 
inconsistencies of self-interaction. As classical field configurations play a growing 
role in modern developments we will return to this subject in later chapters. 


1-1 PRINCIPLE OF LEAST ACTION 


1-1-1 Classical Motion 


In classical mechanics the equations of motion follow from a principle of least 
action. If gq stands for a finite collection of configuration variables, q = 
‘01,42,---,4n}, With q their velocities at time t, the action is defined as 


_ \ raneGy 401 (1-1) 


The Lagrange function L depends on the positions, velocities, and sometimes also 
explicitly on time for open systems, ie., systems subjected to given external forces. 
The principle of least action states that among all trajectories q(t) which join qi 
at time ft; to q2 at time f2, the physical one yields a stationary value for the 
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action. This stationary value is a unique minimum if q;, t; and qo, tz are close 
enough. The action is therefore to be considered as a functional of all regular 
functions q(t) satisfying the boundary conditions q(t:) = qi, q(t2) = q2- If Q(t) Is 
the actual trajectory, a nearby one is written q(t) = Q(t) + dq(t). Expanding the 
action in powers of dq as 


1(q) = 1(Q) + [ag t 5p (OD5a00) + (1-2) 


we express the principle by setting 


)= (1-3) 
eae 
To compare this expression with the Euler-Lagrange equations we note that 
since 


g(t) = dqlt) Salt) = dalts) = 0 (1-4) 


the variation of (1-1) reads 


OL OL dee 
a= |" a Sh aa [a [2 oa = 2 oa 


An integration by parts of the last term, taking into account boundary conditions, 
yields the familiar form 


él OF _ ok od OL OL 9 (1-5) 
5q(t) dq(t) dt Og(0) 
which is to be interpreted as a vector equation in the case of several degrees of 
freedom. In the simplest cases L is the difference between a kinetic part quadratic 
in the velocities and a potential part. 

The equations are unchanged if we add to L a total derivative with respect 
to time, and the action is only modified by contributions depending on the 
boundary conditions. This observation shows that the Lagrange function is not an 
intrinsic object to determine the motion and leads to a more abstract formalism, 
developed in particular by E. Cartan. 

The hamiltonian formulation is obtained through the definition of conjugate 
momenta 


ra) 1-6 
Pi= 55 (a4 (1-6) 


Assume that we are able to invert this equation, 1e., to express the velocities 
in terms of the momenta and the positions. We shall see later what happens if 
the jacobian of this transformation vanishes. Hamilton’s function is then obtained 
by a Legendre transformation 


H(p, q) = pidi(p. 9) — L[, 4(p, 4) ] (1-7) 
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where summation over dummy indices is understood. By differentiation 


og; éL dL 04; (AL 
dH = qe |p, = — ; eS (ee 
E iu CDi (», al et 0q: 04; ee n)| agi 


We see, using the expression of conjugate momenta, that the Euler-Lagrange 
equations take the form 
. wer . ed 1-8 
4 = Bp, P= 0g; (1-8) 


More generally the variation of a function defined on phase space (the space of 
the 2N variables p, q) is 


ey OHO CH Of _ of : 
meer beh dain a TS (1-9) 


We have introduced the Poisson bracket notation 


(Ears ane (1-10) 


It follows from (1-9) that a function without an explicit dependence on time, 
and such that its Poisson bracket with H vanishes, is a constant of the motion. 

It is remarkable that Hamilton’s Eqs. (1-8) also follow from a principle of 
stationarity. Indeed, let us insert 


L(q, q) dt = p dq — H(p, q) dt 
in (1-1) so that the action reads 


tz 
=| pdq—Hat (1-11) 


and can be thought of as a functional of 2N independent functions q(t) and p(t). 
Assume again that q(t,) = q1, q(t2) = qz without any restriction on p(t;) or p(tz). 


We then have 
2 oH d OH 
él = Dis — 6q ——6 dt 
I (4 a) + (» dt" q ‘)| 


Integrating by parts the p(d/dt) dq term we obtain 


1, 0H _—il_~_s=«s« OH 
bp) 4 ap SCgt) Sq 


By setting the variation 5I = 0 we therefore recover Eqs. (1-8). Note that the 
action I can also be written 


tz 
I= pats — Pits ~ | qdp+Hat 
; nh 


Hence p’s and q’s play a similar role in the action. 
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Let us now compute the action J along the stationary trajectory. We assume 
(q2, t2) close enough to (q;,t,) so that the fatter is unique and use the notation 


I(qo, ta 341, tx) or even the shorter form I(2,1). 


It is by no means infrequent to find several trajectories joining two points in a given time. For an 
harmonic oscillator, for instance, infinitely many trajectories return to the origin after half a period. 


We can easily verify that 


61 (qa, ta; qi, t1) = (P2 G2 — Hz dt2) — (pi 6g: — Ai ty) (1-12) 
> o1(2, 1) gi 01 (2, 1) 
where po = ae He Ot, 


If the Lagrange function depends explicitly on some parameter a, we have, 
furthermore, 


t2 


d OL : 
de I(q2, toy; 1; ty) = | dt eu [a, Q(t, ax), O(t, a) | (1-13) 


ty 
where the derivative on the right-hand side is taken on the explicit dependence 
of L on «. As an application of this result, let us add to a given Lagrange function 
a term qF(t) which will contribute an external driving force F(t) to the equations 
of motion. The action will become a functional of F(t), and for t; <t<t, we 
obtain from (1-13) 
61(2, 1) 
OF(t) 


Q(t) (1-14) 


where Q(t) stands for the trajectory in the presence of F (we can also evaluate 
both sides for F = 0). This is not a priori the best method to solve the equations 
of motion, but it illustrates the method of generating functions, a procedure 
occurring frequently in the following. 


Consider, for instance, a particle of mass m constrained to move along a one-dimensional axis under 
the action of a given force F(t). Its Lagrange function is 


+2 
m 
L(q, 4) = = + qF(t) (1-15) 
The solution of the equation of motion can be written 


o() = 2 Ot galt =H) | "a Gy EO 


1) hs i m 


(1-16) 


where the Green function G(t,t’), symmetric in the interchange of t and t’ and vanishing at t= ft, 
and t = to, is a solution of 


ag 
ae G(t, ’) = d(¢ — 1’) (1-17) 


Here o(t — t') is Dirac’s point distribution, the derivative of the step function @(t — t’). We find 


1 


es (py ald 
mae: 


[2 — A(t — t1)0(t — £) + (2 — re — ,)O(' — D] (1-18) 
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The value of the action along the trajectory reads 


— 2 t ean a 
(qo, t23 41; ty) == = (dz = 41)" + | at qiltz — 1) + galt — ti) F(t) 
2 a t to —ty 
1 tz t: 
* am | a | ot Ogee) (1-19) 
2m Jr, i 


We verify that the functional derivative with respect to F yields the trajectory. If we set F = 0 we also 
recover 


d121)_ 2a : 


d12,1)_ om (2 agi 
0q2 tz — ty Ot m 2 


oy = —H(p2, 42) (1-20) 


1s 


As a second example let us recall how this formalism applies to a free relativistic particle. Let x 
and v be the position and velocity of the particle written gq and g up to now. The space-time interval 
reads 


As? = c?At? — Ax? = c?Ar? (1-21) 


where t is the proper time along the trajectory dt = (1 — v7/c?)'/? dt = y ‘dt and c is the velocity of 
light. In four-vector notations 


x" = (ct, x) i — Owes 
and 
As? = g,, Ax" Ax’ = Ax, Ax" 


with the metric tensor 


1 0 0 0 

pA 0 -!l 0 0 v= 5 

Juv =9 0 oe =I 0 Gu = 
0 0 0 -l 


used to lower or raise the Lorentz indices. The interval As? is invariant under Poincaré transformations, 
combining translations (a) and homogeneous Lorentz transformations (A): 


x» x/# = AMX” + at A’gA =g (1-22) 


A Lorentz transformation can be factorized as a product of an ordinary rotation and a boost along a 
direction n: 


x’=x +n[(cosha — 1)x-n— ctsinha] (1-23) 


ct’ = ct cosh a — n°x sinha 


where v = c tanh an is the velocity of the moving frame. The four-velocity 


dx" dt dx\ dt 
a |= (1-24) 
itl (< dt’ 4 dt ev) 


_— , : 2 2 
is a time-like vector of constant length c. Its derivative with respect to proper time d x’/dt” = du*/dt, 
orthogonal to u“, is therefore space-like. For a free particle the four-momentum satisfies 


E 
p* = mut = (=. Pp (c, Y) 


m 
c )- =e (1-25) 


2 
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Note that p = Ev/c?. Let us now find the corresponding Lagrange function using as principles (1) the 
relativistic invariance of the action and (2) the occurrence of only first derivatives of the position in 
L. The last requirement is, of course, borrowed from nonrelativistic mechanics and means that position 
and velocity should be sufficient to prescribe the motion. It may be relaxed under more general 
circumstances to comply with the finite velocity of signals in a relativistic context. In the case of a free 
particle the irrelevance of the origin of space and time coordinates implies, furthermore, that L 
should only depend on first derivatives and that in the limit |v| «<c we should recover L = 4mv? 
up to a total derivative in time. 

It is clear that L = ads/dt satisfies (1) and (2) with the constant a given by the nonrelativistic 
limit. We thus have 


(1-26) 


Momentum and energy follow from the general expressions 
aL mv mc? 
| E=p:v-L=———— 
ov Ji —v/e? /1 =e 


and agree, of course, with (1-25). The free equation of motion is dp/dt = 0, in such a way that the 
generalized force f" = dp"/dt vanishes. The action along the trajectory 

1(2, 1) = —me(s2 — 81) = —mce?[c?(t2 — t1)? — (X2 — x;)?]*? 
is a Lorentz scalar. The standard relations 


A1(2,1)_ 
io 


A1(2,1) _ 


OX> 


View P2 =P 
mean that the relativistic momentum —61/0x, = p* is a four-vector. 
An alternative form of the action may be used: 


2m [dx(t) F 
(= -| an ae | (1-27) 


where t is a priori an arbitrary parameter along the trajectory. The equation of motion obtained 
by requiring J to be stationary, m d’x“/dt? = 0, implies that t is proportional to proper time and 
can be chosen such that u? = (dx/dt)* = c*. In the following we shall set ¢ = 1, unless otherwise 
stated. 


One of the virtues of the Lagrange-Hamilton formalism is to suggest a wide 
class of transformations leaving the structure of the equations of motion invariant. 
These go beyond simple reparametrizations of the configuration space since they 
naturally mix positions and momenta. A transformation (p, q) <(p’, q’) (possibly 
time dependent) is canonical if there exists a function H’ of p’, q' (and perhaps t) 
such that in terms of the new variables the equations of motion also read 


OH . OH’ 
op’ = 0q' 
where indices have again been suppressed. A sufficient condition for this property 


is suggested by the least-action principle. We require that the differential forms 
p' dq’ — H' dt and p dq — H dt differ at most by a total differential. In turn this 
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means the equality of the external derivatives of these forms: 


>, dp; A dqi— dH” dt = ¥ dp; » dai — dH’ n dt (1-28) 


This condition implies that functions on phase space have equal Poisson 
brackets expressed in terms of old (q, p) or new (q’, p’) variables: 


G ty «of ot) Of og of ag 
{ — —_— Oo Or ddI me ee men - 
ila » ‘er Cqi CQ CDi » Cpieq; qi Op; (ee) 


From (1-28) we can also derive equations for H’ which turn out to form an 
integrable system by virtue of (1-29): 


aT E (3 - Sa) 4 Se 4 PY 
Op; <\Opi\Op, ot) api \aq, at 


a E ce — Sie) 4 Se (C5 OP) 
Ogi m4 0qi \Cq, at Oqi \Oq, at 


A glance at (1-29) and (1-30) enables us to recognize that a typical example of 
canonical transformation is given by the solution of the equations of motion where 
(q’, p') correspond to initial data at time to and (gq, p) to the phase space position 
at time ¢. It is clear that H’ vanishes in this case and that Poisson brackets are 
left invariant. 

Assume that H is time independent. The 2N functions q’, p’ expressed in terms 
of g, p, and t are constants of motion. Eliminating t one obtains 2N — 1 (local) 
constants of motion. Only a small number of them can be extended as well-defined 
functions over all phase space. A theorem due to Poincaré states that the latter are 
associated with symmetries of the motion. 

Canonical transformations are, of course, not limited to solutions of the 
equations of motion. Their set generates an infinite group. Finally we remark that 
restricting (1-28) at fixed time and taking the Nth external power of both sides 
shows that canonical transformations preserve the measure I]; dp; “ dq; on phase 
space; this is Liouville’s theorem. 


(1-30) 


1-1-2 Electromagnetic Field as an Infinite Dynamical System 


Systems with infinitely many degrees of freedom are familiar in fluid mechanics, 
electromagnetism, solid-state physics, etc. We shall discuss here how the 
lagrangian formalism extends to the electromagnetic field. 

To obtain the Lagrange function we start from the field equations in the 
presence of fixed external sources. We use Heaviside’s units (with the Coulomb 
force given by QQ’/4nr?) and take c = 1. In terms of the electric field E, magnetic 
induction B, charge and current density p and j, Maxwell’s equations read 


(a) dip» (c) divB=0 (1-31) 


oE cB 
as —=0 
(b) curlB a. j @ curl E + at 
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Local charge conservation is expressed as 


Op oan 
— — 1-32 
a + divj=0 (1-32) 


Let us recall the physical interpretation of these equations in integral form: 


(a) | Eds = \ pdx; the flux of E through a closed surface equals the enclosed 
Ss V 
charge (Gauss’ law). 

(b) }B- ix= | (i +5) the circulation of B along a closed curve C 


bordering a surface S equals the flux through S of the sum of the usual current 
plus Maxwell’s displacement current (6E/¢t). 


(c) | B-dS = 0; the flux of B through a closed surface vanishes—magnetic 
Ss 
charges (monopoles) are absent. 
OB =o. : , 
(d) -$ Eeede — | ads: 3, 2 Varying magnetic flux generates an electromotive 
Cc Ss 


force (Faraday’s induction law). 


It is convenient to use a compact notation for vector, tensor, etc.. fields 
which exhibits relativistic covariance. With greek indices running from zero to 
three: 

xt=(t,x) j*=(p,j) 
we write 
0 -E! -E? —E 
1 3 2 
Fieve 2. eves a 2 a (1-33) 
E B 0 --B 
E> —B? B' 0 
while derivatives are abbreviated as 0/dx" = 0é,. We also define Levi-Civita’s 
antisymmetric symbol é“’*’ to be equal to 1 or —1 according to whether (yvpc) 
is an even or odd permutation of (0,1,2,3) and zero otherwise. Note that 
Euvps = —&"”?’. This symbol is used to transform an antisymmetric tensor into its 
dual, as for instance 


0 —-B' —B? —B 
B} 0 E* —E£? 
BP -E® Q ig 
B bE 0 
In words, F is obtained from F by substituting E — B, B— —E, and 


ead Fe == — FY 


Pr Ph for = 0-34 
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Under Lorentz transformations F and F transform as antisymmetric tensors. In 
particular, if we perform the boost (1-23) we obtain (c = 1) 


nx(Exn)+vxB 


EF’ =(E-n)n+ 
/1-v? 
(Bee E (1-35) 
n x (B x n)—v x 
B = (B-:a)a + —__——_ 
J/1—-v? 
where nee 
Mi 


These explicit forms are not always useful. For instance, to find the relativistic 
invariants constructed from E and B it is easier to use tensor notations and 
observe that they can only be combinations of 


Per =F Pe _ — (8? — B?) 
Fy F" = —4E°B 
Note the following identities : 
Faak* = FyaF” + 40,’ FapF™ 
Pon — gb B 


With these notations and Einstein’s summation convention, Maxwell’s equations 
take the compact form 


(1-36) 


(a) 0,F"’=j" (b) 0,F”=0 (1-37) 
while current conservation appears as a natural compatibility condition 
aie (1-38) 


To proceed, we have to identify the space coordinate x with the index i of the 
various degrees of freedom, this correspondence being such that )); > J d°x. How- 
ever, we are at first slightly embarrassed since only first-order derivatives with 
respect to time occur in Maxwell’s equations if we are to identify E(x, t) and 
B(x, f) with the configuration variables. Furthermore, Lorentz invariance does not 
appear naturally in this formulation. 

To overcome these difficulties let us first transform the equations into 
equivalent second-order ones. The trick is to introduce the four-potential A* by 
recognizing that the homogeneous set of equations (1-37b) is precisely the condition 
enabling one to write 


vi AM AY _ AY gL 
Fe = 9g"A’—O"A (1-39) 


that is, Eee VA — — B=curlA 


This is in general a local statement with a particular solution, in the vicinity of a 
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point taken as origin, of the form 


1 

A*(x) = — GAAP PAX) X, 
/0 

Such a potential is, however, not uniquely defined by (1-39). It can be modified 

by the addition of a four-gradient. This is called a gauge transformation 


A*(x) + A*(x) + 0% (1-40) 


For regular fields satisfying (1-37) A“(x) can in fact be defined over all space- 
time, leading to the equivalent form of Maxwell’s equation 


DAY — 84(0,A") = j" (1-41) 


where (] stands for the d’alembertian: (1) = 0,0" = (6/dt)? — A. Equation (1-41) 
is clearly not affected by a gauge transformation. Gauge arbitrariness of electro- 
dynamics may appear sometimes annoying and sometimes a deep and far-reaching 
principle. Clever choices of gauge lead to interesting simplifications but can also 
destroy manifest covariance. 

Our first goal has been reached in the sense that we now have second-order 
equations. Compact notations may, however, be misleading. Let us therefore have 
a closer look at these equations, which read explicitly 


0 (aA° 7) 
= D)A®° —-— |—_ + di = —AA° — j 
p=U yr ( Fy div a) AA ey div A 


) , 
j= DA+ v(£4° + iva) 
For some gauge-fixing conditions, such as div A = 0 (Coulomb gauge), no time 
derivatives appear in the first of these relations (Poisson’s equation), which plays 
the role of a constraint. One may then solve for A°: 


d*x’ p(t, x’) 
Ao pee | SMES 
ion \2 |x—x’| 


The vector potential is now given by 


OA=j-V \ d°x' (6p/ot)(t, x’) 

4n  |x—x’| 
The divergence of the right-hand side vanishes, of course, due to current con- 
servation. 

The above choice is sometimes used in order to get a lagrangian formulation 
as a first step toward quantization. It has the obvious drawback of breaking 
manifest covariance even though the underlying physical picture, corresponding 
to the elimination of some irrelevant degrees of freedom, may be appealing in 
specific instances such as bound-state problems. 

We want, however, to write an action using a Lagrange function preserving 
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the local character of the theory. This is why we shall not use this gauge with 
its instantaneous Coulomb potential. Locality is a deeply rooted physical principle 
emerging from the nineteenth century formulation of field theory. Its implications 
underlie most of the developments of relativistic field theory, and its verification 
down to very small space-time intervals (via dispersion relations, for instance) 
is unquestioned up to now. 

The Lagrange function should therefore be expressed as a space integral over 
a density, the so-called lagrangian (x), which in turn should depend only on the 
fields and on finitely many of their derivatives. We do not really claim that 
these fields are directly measurable quantities ; this is obviously not the case for the 
gauge-dependent potential A,. Locally measurable quantities should, however, be 
expressed in terms of local combinations of the dynamical variables. We write 


I= [eee (1-42) 


Most of the time we shall not specify the boundary conditions of this integral. 
Let us assume that it extends throughout space and that the fields vanish sufficiently 
fast at infinity to justify integration by parts. 

We assume A“(x) to transform as a four-vector field and the lagrangian as a 
scalar density in order for the action to be a Lorentz invariant. We require, 
furthermore, that Y depends only on the fields and their first derivatives and be 
modified at most by a divergence under a gauge transformation (1-40). This is a 
natural generalization of the analogous property of the Lagrange function and 
ensures the gauge invariance of the equations of motion. 

In general, if a lagrangian depends on fields 9,(x) and their gradients 0, ;(x), 
then under an infinitesimal variation 50;(x) of the fields the action will change 


according to 
aie). OLX) sn 
r= {a fe) 6@;(x) + ARaRoGo| [ 1 Gilx)] 


OL (x) aL 
= at — 0,3 7 8 (x) 
| 0@i{x) : 8[0,.0i(x)] 
Requiring the action to be stationary leads to the generalized Euler-Lagrange 
equations 


(1-43) 


81 aL (x) 5 OH) _ ne 
dpi(x) d(x)  ”“ d[d,e(x)] 
According to the rules set before, we have thus in the electromagnetic case to 
adjust the coefficients a, b, c, d, e in the lagrangian 
L(x) = 4[a0,A’G"A, + bd,A°0,A" + c(G,A4)? + dA, A" + CA," 
in such a way that Eqs. (1-44) will coincide with (1-41). Up to an overall coefficient, 
a simple calculation yields 


L(x) = BF pF = jy" +5 [6,4*)? — 0,470, 
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where F’,, is used here as a shorthand notation for the expression of the electro- 
magnetic tensor in terms of the potential. The coefficient c remains arbitrary and 
multiplies a divergence as was expected: 


(0,A")? — 0,4°0,A* = 0,[A,(g""6,A? — 6°A*)] 


This last term can therefore be omitted and we are led to the action 


E? — B? 
les | assar? + pA) [a(2 = 


In the presence of a given external current this lagrangian is not gauge 
invariant. Under a gauge transformation (1-40) it picks an additional contribution 
which is a divergence by virtue of current conservation : 

j,0"o = O"( Pin) 
This fact explains the invariance of Maxwell’s equations. We conclude that current 
conservation is a necessary and sufficient condition for the gauge invariance of 
the theory. 

Since this question is of the utmost importance in the quantum case, it is 
interesting to present another point of view. Up to now we did not restrict the 


arbitrariness on A’. The structure of Eq. (1-41) naturally suggests the Lorentz 
condition 


— pA® ria] (1-45) 


6, A(x) —0 (1-46) 
in which case (1-41) reduces to 
At =" (1-47) 


This is, of course, compatible with (1-46) and restricts severely the gauge 
arbitrariness, since now only those transformations A" + A“ + 6“@ with 


O¢ =0 (1-48) 


are allowed. 

The Lorentz constraint can be incorporated in the formalism using a Lagrange 
multiplier 1 to add a term Af d*x3(d- A)? to the action. In this way Maxwell’s 
equations are modified and the new equations read 


LIA* + (A — 1)0*0-A = j* (1-49) 
Taking the divergence of both sides we see that, for 1 4 0, 
Ld, A" = 0 (1-50) 


Hence if 0,,A” vanishes for large |t| it will be guaranteed to vanish at all times, 
and (1-49) will therefore be equivalent to Maxwell’s equations in the Lorentz gauge. 
Let us summarize the logic: 


1. & = —4F? —j- A. Gauge invariance current conservation. Least-action 
principle + Maxwell’s equations. 
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2. f= —4F* +940- Al —j-A and @-j=0. Least-action principle > 
LJ¢,A* = 0. Boundary conditions > Lorentz gauge -> Maxwell’s equations. 


Simple remarks enable one to remember the signs in (1-45) irrespective of the conventions. The 
“kinetic” terms in (@A/dt)* should appear with a factor + 4. Moreover, since p°A° is a potential energy 
it has to occur with a minus sign. The rest follows from Lorentz invariance. Note also that in 
rationalized units J d>x(E7/2) is an energy and therefore J d*x(E2/2) has indeed the dimension of an 
action: energy x time. 


1-1-3 Electromagnetic Interaction of a Point Particle 


The four-vector current of a charged point particle is localized on its space-time 
trajectory x"(t). Therefore 


a (sae ie =, 
J vet ae d°Ly — x(t)] |r=x%2) = € ae 6*[y — x(z)] (1-51) 
with e denoting the conserved charged J d°y jt, y) = e. The current j” as expressed 
by (1-51) fulfills the continuity equation 0, j" = 0. 

For the combined system particle + field we simply add the actions 


y 2 2 
a ie +): 4) —m {as =— [a ~ —e | dxeatLoto —m [as 
= [as L(x) + | am, /1 —v? + eA-v—eA’") (1-52) 


Any charged particle acts as a source, thus modifying the surrounding field. 
As a first approximation we shall neglect this effect and assume A, fixed by 
external conditions. This allows us to drop the contribution of ¥,,, in (1-52). 
Observe that we take in this case an opposite view to the one followed at the 
beginning of this discussion. Genuinely coupled systems, whether classical or 
quantum mechanical, are obviously difficult to study. It is a fair practice to first 
investigate simplified models before looking at the more realistic ones. In Sec. 
1-3-2, we shall briefly investigate a classical coupled system. We are thus led to 
study the motion of a particle under the action of a given external field. Its 
Lagrange function is 


L=-— m,/1 — v? + e(A:v— A®) (1-53) 
Wesee that its conjugate momentum p differs from the free value mv/\/ 1 — v’, since 


ee (ss 


vv /i-v 
Hamilton’s function 1s expressed as 


H= ———————— + eA° = [m? + (p — eA)*]'7 + eA° (1-55) 


ov /1-v? 
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while the equation of motion (d/dt)(6L/dv) = 0L/éx takes the form 


d mv OA 
= —¢ — — CVAD + ey x curl A= ei + ¥ xB) (156) 
dt /{ — y? ° at ( 


which is nothing but the Lorentz force law. Finally the variation of energy & is 


dé ad m d my 
Ge * | et | eee oe 1-57 
dt “( ; =} “fae cee 


a well-known result expressing the fact that only electric fields do work. Equations 
(1-56) and (1-57) can be written compactly as 
u 
cic) (1-58) 
dt 
with u* = dx"/dt being the four-velocity. 
The reader can verify that the variational principle applied to the action 
I = —Jdtm(u/2) — J eA: dx leads directly to (1-58). 


Let us now study three simple examples : 


1. Motion in a constant uniform field 
Let F,,, be independent of x. Equations (1-56) and (1-57) lead at once to 


my MVo 


ee 


We can also solve directly the covariant Eq. (1-58) in matrix form: 


=e(Et+xxB) €—6&),=eE-x (1-59) 


et 
Tl 63 (exp —F aur 
m 
Finally it is instructive to use a spinorial representation. For a four-vector u we introduce the 2x2 
matrix : 
poe Uuol +usa 


where o stand for the Pauli matrices. Using the identity c-ao-b=(a-b)I + ia: (a x b) we rewrite 
(1-58) as 


(1-60) 


from which it follows that 


E + iB -i 
vey mean (EB. oem (# #=2..) 


If n stands for the complex three-vector E + iB and a = (e/2m)(n7)!/2 we find that 


et F no 
exp E (E + iB): | = cosh (at)I + sinh (art) (ay 
In the frame where u(0) = J and x(0)=0 we obtain in the case where E- B= 0, i.e., when the 
parameter a is real, 
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Vo 


(a) (b) 


Figure 1-1 Motion in a constant uniform field. (a) E = 0. (b) B=0; E-¥o = 0. 


2 2 
u(t) = cosh’ (at) + sinh? (at) E + | 


y E 
+ [2 sinh (at) cosh (at) ——— + 2 sinh? (at) : I] ue 


(n2)1/2 n 
; 2 2 
Gare |: _ (= (2at) _ s(1 : E?7+B )] é (1-61) 
i 4a 2 n? 
Fi [a (2at)-1 E sinh (2at) — 2at E x =| - 
2a (n?)1/2 2a n? 


In the limiting case where n? = 0, that is, E- B = 0, E* — B? = 0, we find 


pie? 
= eX B 
igo 


e*E? 7? é 
= u(t) =t— + 
2 m 


uo(t) = 1+ —; 
m 
Note that the velocity grows faster in the E x B direction. If E-B vanishes but E? — B? #0 we 
can in fact pick a frame where either E or B vanishes. If E=0 the particle describes a helix 


with an axis parallel to B, at constant angular velocity w =(eB/m),/1 —v’. If B vanishes and 
Vo‘ E = 0 the trajectory is a “catenary” in the (vo, E) plane with its concavity in the eE direction 
(this catenary reduces to a parabola in the nonrelativistic limit) (see Fig. 1-1). 


2. Gyromagnetic ratio. Thomas precession. Bargmann-Michel-Telegdi equation 
We introduce the concept of intrinsic magnetic moment and gyromagnetic ratio in this classical 
picture. This is a subject of controversy if one requires a consistent theory. Suffice it to say that 
we simply consider here a useful limiting situation of a complete quantum treatment. 

Recall that an elementary current loop is equivalent to a magnetic moment dp = id&, where i 
is the current and d% = nd& stands for a vector normal to the plane of the loop of magnitude 


equal to the area. When the current is generated by a nonrelativistic orbiting charge, i = ev/2nr: 
e 
eB (1-62) 
2m 


where L = r X pis the orbital angular momentum. Equivalently in a homogeneous external magnetic 
induction B the interaction part of the Lagrange function can be given the form 


Ling = @A+V = 5(B X1)-¥= 4B (1-63) 
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From the Lorentz force law 
at irk (1-64) 
dt 2m 


we learn that both the angular momentum and the magnetic moment precess around the magnetic 
field with the classical Larmor frequency w = eB/2m. 

In 1926 Uhlenbeck and Goudsmit introduced the idea of electron spin (intrinsic angular 
momentum) and magnetic moment, with p = g(e/2m)S(jS} = fy2; in modern language, spin 3) and g 
(reduced gyromagnetic ratio) equal to 2, to reproduce the Zeeman splitting. Unfortunately this same 
value, g = 2, seemed to produce, for an electron moving in a central potential, a spin-orbit coupling 
twice as large as the one required by the fine structure of hydrogen. Let us assume that 

ds 
eae fh Bes (1-65) 


holds in the rest frame of the electron. If E and B are the fields in the laboratory frame where 
the electron has velocity v, then from (1-35) Brest = B — v x E + O(c’). Therefore, in the small velocity 
approximation the magnetic interaction energy due to spin would be 


U' = —p:(B-y xE) (1-66) 


If the electric field of an atomic nucleus is taken to derive from a spherical mean potential 


E= —- — 1-67 
. r dr ( ) 
the above energy reads 
ge g lay 
"= -—-—S§-B+ -——;S:L- — 1-68 
a 2m i 2m? r dr i 


The catch, as first shown by Thomas, comes from the incorrect treatment of Lorentz trans- 
formations due to rotational motion. In other words, a pure boost from the laboratory inertial frame 
leads to a frame rotating with angular velocity w; so that the correct energy must be written 


U =U —S-a, (1-69) 


This is a typical relativistic effect which can be obtained by considering the product of the two 
boosts of velocities —v and v + dv. Here dv =v or. As any Lorentz transformation this product 
can be factorized into a boost of velocity Av given by 


Avy aa E + SB t)s pli | 
— v — 
J/1l-v? /\—v? v 


and a three-dimensional rotation 


1 xd 
Ao = @7 6t = +1)" ad ~4vxvoét 
1—v? v 
In the present case 
r dV L idVv 
ST as 
, ‘ (1-70) 
-1 
and u--£s-n+(! r)s-1. 
2m 2m ip (ihe 


If g = 2 this effect indeed reduces by half the spin-orbit coupling in agreement with observation. 
Dirac’s theory (Chap. 2) leads naturally to this g value for an electron or muon. But of course other 
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spin 4 particles, such as the proton (g = 5.59) or the neutron (g = — 3.83), have very different g values 
due to internal structure. 

Let us return to the motion of a spinning particle in a constant magnetic field at small velocities. 
The velocity precesses with an angular frequency eB/m while for spin the corresponding frequency 
is geB/2m. In one period the relative phase will therefore be 272(g/2 — 1)eB/m. Bargmann, Michel, 
and Telegdi have obtained a relativistic description of this classical motion of spin valid in slowly 
varying external fields. 

We represent the spin degrees of freedom by a three-vector S in the rest frame of the particle. 
In covariant notations it is therefore described by a space-like four-vector S* orthogonal to the 
velocity u“. We want to generalize the equation dS/dr = g(e/2m)S x B valid in the rest frame where 
S* = (0,8). To preserve the condition S + u = 0, we should have S +4 + S + u = Ousing dots for derivatives 
with respect to proper time. In the instantaneous rest frame, u = (1,0); therefore $° = S-a and 
y4 = [S+ tt, (ge/2m)S x B]. 

To express this in any frame, we observe that F“,S” reduces to (E-S, S x B) in the rest frame. 
Therefore the required equation is 


je FE poy © (2 _ 1 user gut) (1-71) 
2 m \2 


From (1-71) it is clear that if g = 2, S and u move rigidly. This is not the case when g # 2. Thus we 
derive a method to measure the particle’s magnetic anomaly g/2 — 1. 

Define in the laboratory plane with time axis ¢ = (1,0) the four-vector L with L along u and 
L-u =O, that is, L =(u cosh  — f)/sinh y, cosh g = u:t = u°. Furthermore, let M be a unit four- 
vector in the (S, L) plane, orthogonal touwand L. We set S = L cos 0 + M sin 0 (see Fig. 1-2). Let us find 
from (1-71) the equation of motion for @: 


S = 6(—Lsin 6 + Mcos 6) + Lcos@+M sin 0 


(F- Loos 0+ F-Msin 0) += (5 — tuk: F-ncos 0+ M-F-usin0) — (1-72) 
nm 


ge 
2m 


Taking the scalar product with M and using L: M =u:L=u-+M =O we find 
EO MO Laer PL, 
2m 
Now M-L=coth ou: M = (e/m)coth gM: F- u, leading to 


d= (cot oa F-u—$M-F-t) (1-73) 
m 


M 
S 
B 
L 
6 Ju 
(a) (b) 


Figure 1-2 (a) Rest frame of a particle with spin S. (b) Motion in a pure magnetic field. 
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or explicitly 


ga | (ear y — cosh? o) aM + (3 = 1) coth ¢o(M x B)-u (1-74) 
ae WAZ sinh @ 2 ; 


In the case of a pure magnetic field with the particle orbiting on a circle at the (proper) Larmor 
frequency w = eB/m, the mixed product (u x M- B) equals sinh gB. Thus 


6= (- 1) cosh Q 
2 
6—O@= (3 - tor cosh @ 


For a period At = 22/w, 
g g oC 
AO | period = (2 = 1)on cosh 9 = (: - 1)an —; (1-75) 


3. Motion in a plane wave 

Finally, we study the motion of a charged spinless particle in the electromagnetic field of a plane 
wave, which we assume linearly polarized for simplicity. The wave is characterized by its light-like 
propagation vector n# and polarization e“. These two vectors are such that 


W=e-n=0 e=—1 (1-76) 


The potential depends on an arbitrary function of the variable € = n- x: 


A"(x) = e"f (C) (1-77) 
From (1-76) it follows that 0- A = 0, while the field 
F(x) = (n*e” — n’e") f’(0) (1-78) 


satisfies 
FY,FY = FY FY = nn’f'(£)? 
F*,F4” = 0 
| leo) 
Therefore |E| = |B| and E- B = 0. Since n, F”” = n,F“’ = 0, the classical Lorentz equation 


du* ey 
m an = eF",u (1-79) 


leads to u+ n = constant. Choosing coordinates such that at t = 0, x(0) = 
eo — nn O)eint 
The variable can be used in place of t in (1-79) in such a way that 


du* 
rT =£ yore re = 


Multiplying by ¢ and integrating gives 
(2 
u-e=u0)-e+— [f()-f0)] 


which can be inserted in du“/dé, leading to 
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am € &-u(0) + (e/m)[ f(E)—F(0)] 
ae on if Ge a —e et (1-80) 
A last integration gives u(é) as 

2) = uo) -+ <P #12) — tl — | e nt 

u#(2) = u*(0) + - [F(a —. fon 10) en ee [F(2) — F(0)]? nO) (1-81) 
If we assume that the wave is damped at large ¢, we find 

EM. e .. g-u(0) e? f*(0) 
Alea) err” (0 : . : 
mami m IO E uO) 2m? n° LO |e hs) 


We note the appearance of nonlinear effects (terms in f 7). 
For a monochromatic plane wave such as f(¢) = a sin €, for instance, we can compute x* ina 
straightforward way : 


8) = wr — 78 a Es 
m 


Ht se a? 
n-u(0) = a Sei ,| 20 u(O) sin? e+] rom (€ —4sin 20) (1-83) 


The action is expressed as 


1(¢) = - | aeom + eA-u) 
0 


|, ae se e{e-uoy+ = 706) -s0)} 
a m 

—mu({co)* x — [ie nic 
ne 


0 


e* A? 
-[ea- u(co) — | (1-84) 
2m 
In the last form we have used u(sc) defined by (1-82) under the assumption that f(¢) vanishes at 
infinity (recall that n- wis conserved). From (1-84) we can obtain the conjugate momentum as 
d1(¢) 


n# eee 
= — = mu"(co) + ——| eA: uo) -— A (1-85) 
OXy n-u 2m 


For certain potentials it is meaningful to define averages such that A =0 [for instance if f(€) = 
e7 "ls| g(é) where g is periodic and y + 0]. We then derive from (1-85) that 


n? = m? — e? A? = m? + e? | A?| = mM of (1-86) 


This last formula is to be interpreted with care because of the anisotropies arising from the plane 
wave. It shows, however, that a particle in a strong periodic field will respond to external perturbations 


with a larger inertia. 
Linear accelerators provide a typical instance of such a motion of electrons in a traveling wave. 


1-2 SYMMETRIES AND CONSERVATION LAWS 


1-2-1 Fundamental Invariants 


We return to systems with finitely many degrees of freedom. The main task is, of 
course, to solve the equations of motion, with appropriate boundary conditions. 
General properties of the motion such as symmetries are helpful since they simplify 
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the calculations. They can also be used to restrict the class of dynamical models. 
We have already seen examples of this type with Lorentz invariance. 

Symmetries may play a double role. They enable one to generate families of 
solutions from a given one if some transformations leave the dynamical equations 
invariant. Or they lead to the conservation of quantities such as charge, energy, 
momentum, etc. The deep connection between these two aspects is our present 
subject. We start from a very simple example. A nonrelativistic point particle 
moves in a force field deriving from a time-independent potential. The position 
and velocity at time ft given initial conditions at time zero are, of course, the 
same as those at time t+1t if the same initial conditions were given 
at time t. The problem is invariant under time translation. We also know that 
in such a case energy (ie. the value of Hamilton’s function) is conserved. Let 
us see how the two properties are related. Any function on phase space varies 
along the motion according to 


iigme nner, : 
at eee 


Time-translation invariance is equivalent to the statement CH/¢ét = 0. Since one 
also has {H, H} = 0 it indeed follows that 


dH 

dt 
and energy is conserved. This simple remark is sufficient, for instance, to find the 
motion explicitly if the particle is restricted to move in one dimension. 


Alternatively, consider the action computed along the stationary trajectory 
leading from (q;, t;) to (q2, t2). Invariance under time translation means 


0 (1-87) 


I(q2, tz + T; Qi, t1 + tT) = (qa, to; qi, ti) 


or in differential form 


ae 0 1-88 
Ot. Oty So! 

Taking into account Eq. (1-12) this is indeed 
Az = A, (1-89) 


The conservation law clearly follows from the existence of a continuous invariance 
group. Similarly, for space translations 


I({qn + abo, t23 {dn + Biota) = I({@n}2 to} \@upa, ti) 


Recalling again Eq. (1-12) and differentiating with respect to a, we get the con- 
servation law of total momentum 


()) Pn)2 = (0) Pn) (1-90) 


The previous examples can be thought of as special cases of the formula 
giving the variation of the stationary action when an external parameter « is 
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changed, Eq. (1-13). Indeed, « could be chosen as the parameter characterizing 
the transformation. 

If we also have rotational invariance, consider an infinitesimal transformation 
of angle da around the axis n: 


q->q+oanxq 
The very same formula 51(2, 1) = p dq — H 6t |? leads to 


> Pr (a X qn) |2 = Y pa (m X Gu) | 


Since n is arbitrary we therefore find the conservation of total angular momentum 
as 


> qn X Pr |2 = y qn * Pr |i (1-91) 


Of course, if we only have rotational invariance around an axis the corresponding 
component of angular momentum will be the only one conserved. 

In summary, when the dynamical problem admits a symmetry the stationary 
actions /(2, 1) and /(2’, 1’) are equal, where primes denote transformed boundary 
conditions. Whenever the transformations form a continuous group we obtain a 
conservation law by differentiating with respect to the group parameters. 

However, symmetries need not be continuous. Such is the case for parity, 
time reversal, etc. For instance, in the latter case we have I(q2,t2; 41, t1)= 
I(q1, —t1; 42, —t2) where boundary conditions are interchanged and time is 
reversed, corresponding to p, < — p,. Of course, this invariance does not lead to 
a conservation law. 

We finally cast the above considerations in a form suitable for later 
generalizations. To be specific we return, for instance, to rotational invariance. 
Under an infinitesimal rotation q goes into Rqg=q+ dan x q. The Lagrange 
function is assumed invariant when R is time independent. However, the least- 
action principle allows us to choose variations around the stationary path Q(t) 
of the form 


6q(t) = da(t)n x Q(t) (1-92) 
provided da(t2) = da(t,) = 0. Using the invariance of L when 6a is constant it 
follows that 


61(Q) _ rep be = 
da(t) — dt 0da(t) 


Hence the conserved quantity is proportional to 
_ OL 064; 
Sag 0g; 0da(t) = 2m 


since 04;/05a = 0q;/0da. In the present case this is, of course, the component of 
angular momentum along the direction n. 


=e (1-93) 
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For time translations the above method is to be used with care. If the infinitesimal parameter Sa 
is constant, initial and final times are displaced. We set dq = da(t)g, 64 = 0%g + 6aq with 
da(t1) = da(t2) = 0. In the neighborhood of the real trajectory 


O= 61 [ a[e (a5 + a5) +08 | 
= = OL — — — 
i 4 5g 434 aq 4 


fi aesa[$ (0a) -2| 
=I \ phe as aa) ot 


Invariance means generally that @L/@t vanishes, in which case energy H = pq — L is conserved; 
more generally (d/dt)H = —0L/dt. 


It may occur that the equations of motion are invariant but not the Lagrange 
function. Under an infinitesimal time-independent transformation, L is modified 
through the addition of a total derivative in time. In other words, 6L/Cdx = (d/dt)p 
and (d/dt)(0L/d6a — ¢) = 0 for a time-dependent 6x. Thus we find a conserved 
quantity which is not 0L/06% any more and explicitly depends on time. 


As an example, the dynamics of a particle moving under a constant force is translation invariant 
but momentum is, of course, not conserved. The Lagrange function is L = 4mq? + q° F(t). Under a 
translation da(t), dL = F-da = (d/de) J‘ de’ F(t’) da(t’). The integral of the motion is therefore 
mq — Ji, dt’ F(t’) = constant in agreement with naive expectations. Note that even for a constant F 
this is explicitly time dependent. Physically it is, of course, impossible to create such a force 
throughout all space as required by translational invariance. 


Extending these relations to infinite systems will not create difficulties. We 
shall distinguish two types of symmetries. The first one will correspond to 
geometrical transformations of space and time under which the lagrangian Y(x) 
will go into Y(x’), where x’ is the transformed point (Sec. 1-2-2). The second 
type will leave the lagrangian invariant and will be called internal (Sec. 1-2-3). 
Symmetries play such a fundamental role that we shall devote Chap. 11 to a 
deeper study. 


1-2-2 Energy Momentum Tensor 


For an infinite system we assume a lagrangian depending on the space-time 
coordinates x only through fields and their gradients. Under a translation we 
therefore have 


L(x + a) = LO (x + a), 0,bi(x + a)] (1-94) 
Consider an infinitesimal x-dependent transformation 
6g; = da"(x)0,i(x) 
50 nhi(x) = 50°06 ,i(x) + O,[ 6a"(x)]0,6;(x) 
After an integration by parts the corresponding variation of the action is 


4 OL v 
Of {a fe - o,| soo adi) da”(x) (1-96) 


(1-95) 
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We obtain in this way a generalization of our previous discussion of energy 
momentum in a completely local form. From the vanishing of d/ for arbitrary 


da‘(x), we deduce that the energy momentum flow described by the canonical 
tensor 


~ oF 
0” = ——__ 0,6; — gyww¥L 1-97 
(6,0) ) Gu ( ) 


satisfies the conservation law 


0,0" = 0 (1-98) 


(From this derivation the two indices of ©“” play a dissimilar role.) It follows 
that the four quantities P” corresponding to total energy (v = 0) and three- 
momentum (v = 1, 2, 3): 


Pp’ = | d?x ©°"(x, t) (1-99) 
are time independent, since 
3 
Pp’ = fax 690(x, t) = — [ex Y 40x, t) = 0 
i=1 


provided that the fields vanish sufficiently rapidly for large arguments, i.e., no 
energy Or momentum escapes at infinity. 

This result is a typical case of Noether’s theorem. The latter states that to 
any continuous one-parameter set of invariances of the lagrangian is associated 
a local conserved current. Integrating the fourth component of this current over 
three-space generates a conserved “charge.” Invariance of the lagrangian means 
in this geometrical context that we also allow the possible transformation of the 
space-time argument as appears, for instance, in (1-94). Furthermore, the space 
integral defining the charge can be performed on a space-like surface o with a 
surface element do, without affecting the result : 


py | ao, 6" (1-100) 

It can, of course, happen that the lagrangian also depends explicitly on the co- 
ordinates x, in which case (1-94) is no longer valid and Eq. (1-98) is replaced by 

8,0" = — Brepicit £ (1-101) 


Such is the case if Y is a sum of an invariant part Yo plus a coupling term 
to external sources j;(x) of the form £, = L; ¢;(x)ji(x). The energy momentum 
tensor gets a contribution ©4” from Lp plus an added term from 7; and reads 


6 = Ot — g”, (1-102) 
with 
On 6” = = Pi0ji 
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according to (1-101). This last equation can be rewritten as 


8,08" = Yj) 0° (1-103) 


These two ways of expressing the local variation of energy and momentum differ 
according to whether the interaction energy — J d°x Y, is included or not in the 
system. 

Let us apply this to the electromagnetic field coupled to an external conserved 
current j where the lagrangian is given by 


= —4F?—j-A (1-104) 
and On" = O#" 4+ gj A 


i (1-105) 
Os = —F*?0°A, + 4g" F? 


The tensor © is not gauge invariant even if j is zero! Under a transformation 
A>A+ 06: 


O” + O” + 4,(g"j°b — F*? d’d) — j* a" (1-106) 


In the absence of external sources the added term is a divergence and will not 
contribute to the value of the total energy momentum if the fields vanish at 
infinity. 

The energy momentum density is in principle measurable and, moreover, is 
coupled to the gravitational field. It is therefore very unpleasant to find a gauge- 
dependent expression. In addition, the antisymmetric part of ©“” is nonvanishing: 


©” — O” = F’? GHA, — F*" 0A, (1-107) 


We know that the lagrangian is not entirely determined by the equations of 
motion. Correspondingly, the energy momentum tensor admits some arbitrariness. 
Generally speaking, let k(x) denote a conserved current. Then the charge 
K = J d>x k°(x, t) and the local conservation law are unchanged if k# goes into 


k* — k* + Ak# (1-108) 

provided that 
0, Ak* = 0 (1-109) 
and jer Ak°(x, t) = 0 (1-110) 


A solution to these constraints is given by 
Ake = 0 sk"? (1-111) 


where k"? is antisymmetric and depends locally on the fields. Indeed, (1-109) is 
verified and so is (1-110), since 


3 
[ex 09k? = [ex Yak! = 0 (1-112) 
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Returning to the electromagnetic energy momentum tensor, we see that we can 
add to the canonical ©*” a piece 


AO” = 6,0" (1-113) 


where ©#** is antisymmetric in (1, 0) and local in the fields. The discussion of 
the gauge dependence where 


Ana” = =Car 0’) (1-114) 
suggests the compensating term 
AO” = 0,(F"?A”) (1-115) 
We therefore set 
O* = © + O,(F"A”) = g*(AF? + j- A) — FH?G"A, + 0,(F#?AY) (1-116) 
Using Maxwell’s equations, we get 0,F"° = —j*, so that an alternative expression 
for ©” is 
O” = ig F? + FHF Y + gj. A — je A” (1-117) 
In the absence of sources this energy momentum density enjoys the following 
properties: it is gauge invariant, conserved, symmetric, and traceless. According 
to the general framework, if j is nonvanishing then 
6,0” = 6,0" = A, a"? (1-118) 
This can be also stated as 
6,08" = ,(hg!"F? + FHPF,") = j,F™ (1-119) 
The new tensor © can thus be identified with the pure electromagnetic contri- 
bution. The energy density ©3° = 3(E? + B’) is positive and the momentum 
density O° = (E x B)' is the familiar Poynting vector. We recover the known 
result that 


oral ix 08° = | adS-(e «B)+ | Px Ei 
dt Jy s 


After these manipulations ©*” is still gauge dependent in the presence of sources. This should not 
be too surprising because the system is an open one. To see what happens in a closed system, 
taking the sources into account, consider a set of charged point particles interacting electromagnetically. 
The total action reads 


fos [ar i), oe X2(ta) + CnXn(Tn)* Acx)| (1-120) 


It is invariant under a space-time translation 
A(x) A(x+a) — Xn(Tn) > Xn(th) — (1-121) 
If we consider infinitesimal variations of the form: 


5A"(x) = 6a’(x) 0,A"(x)  dx¥ = —da"(xn) (1-122) 
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we find 
_2F _ 5 6 =0 (1-123) 
6a,(x) 
where ©""(x) = 4g"’'F? — F#?a"A, + | aaenst(ea 5*[ x — xq(tn)] A’ [Xn(Ta) ] 
te y Mn | cwittnbted 64 [x = Kala) (1-124) 


Repeating the steps leading to a gauge-invariant tensor, i.e., adding 0,(F"’A"), we get 
@"” = 4g"F? + FF’ + Ym, | acotts 6*[ x — Xna(tn) J (1-125) 


All unpleasant terms have disappeared leaving a gauge-invariant, symmetric, and conserved total 
energy momentum tensor. Observe that even though the dynamical equations couple the material 
points to the fields, their contributions appear additively in (1-125). 


Let us now study the consequences of Lorentz invariance about a fixed point 
to generalize the conservation of angular momentum. In infinitesimal form 


Ox = 0M" x, (1-126) 


with dw” antisymmetric. We observe that the lagrangian is not, in general, 
invariant under such transformation on the arguments alone, since they should be 
accompanied by corresponding transformations on the fields. In other words, the 
fields carry a representation of the homogeneous Lorentz group. The elements A 
of the latter transform them according to 


“o(x) = S(A)p(A~ *x) (12107) 


We have collected the fields ¢ in a column vector and S(A) is a matrix representa- 
tion of the group. For A close to unity, A~ ‘x will be equal to x — 6x with 6x 
given by (1-126), and S(A) will assume a corresponding form. We leave it as an 
exercise for the reader to study the case of the electromagnetic field as an example 
of such a circumstance, and we will limit ourselves to the simpler situation where 
the fields are scalars [S(A) = 1]. To apply Noether’s theorem we have to consider 
dow” in (1-126) as functions of x and to identify their coefficient in the variation 
of the action, with due care paid to the fact that dw" = —dqm"". If ©”” stands 
for the symmetric energy momentum tensor, one finds the conservation of the 
generalized angular momentum: 
Jer? = @Y’x? — @#Px” 

(1-128) 
0,3 ae 


This represents only the orbital part in the case where the fields transform according 
to nontrivial representations of the Lorentz group. Additional parts corresponding 
to the internal contributions appear in J”? to build the conserved quantity. In 
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Chaps. 2 and 3 we shall deal explicitly with such instances. It is crucial that @”” 
be symmetric for (1-128) to yield such conserved densities and time-independent 
associated charges (six in number): 


je fax J°¥°(x, t) (1-129) 


Note that J’? is not translationally invariant. Under a displacement a” of the 
origin of coordinates, the orbital part varies by an amount a’P? — a’P’. To obtain 
what really deserves to be called the intrinsic angular momentum we construct 
the Pauli-Lubanski vector 
Japs 
W,= Teenie (1-130) 
which reduces in a rest frame (P = 0) to ordinary three-dimensional angular 
momentum. 
Up to now in this relativistic context we have avoided appealing to the 
hamiltonian formalism or introducing Poisson brackets. The reason is that we did 
not want time to play a special role. However, nothing prevents us from doing 


so. At a given time t and for a generic (x,t) we can associate the conjugate 
field 


ay See. 
00o9(X,t) d0o¢(x, t) 
Similarly, we define the Poisson bracket of two functionals L,; and L2 of the 


fields g and z at time t as follows: 


OL; OL, OL, 6L2 


 — fe he 1-132 
(ieee = — \Tanks} ars] ee oe ea ea Ce 


m(x, t) = [eyeo, t) (1-131) 


These functional derivatives are to be taken with a grain of salt. For instance, 
if L is expressed as a space integral over a density involving gradients of the 
fields, a suitable integration by parts is always understood. In particular, 


{n(x, t), p(y, t)} = 6°(x — y) 


(1-133) 
{n(x, t), x(y, t)} = {o(x, 2), oly, t)} = 0 
and, for instance, 
| {n(x, t), Voly, t)} = — Vxd°(x — y) 
It is easy to see that the field equations (1-44) take the form: 
r) 
0o9(x, t) = {H, o(x, )} =——— H 
an (1-134) 


r) 
Gon(x, t) = {H, 1(X, t)} =— Fea H 
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in complete analogy with nonrelativistic dynamics, when we have identified Hamilton’s function 
FH as the integral of the energy density : 


H= [ex ©°°(x, t) (1-135) 


expressed in terms of ¢(x, f), V(x, t), and 7(x, t). Equation (1-134) expresses the fact that H generates 
time translations of the system. The reader should find it easy to generalize it to space translations 
and infinitesimal Lorentz transformations. 


1-2-3 Internal Symmetries 


Systems such as those of particle physics are endowed with internal symmetries 
which play a prominent part in analyzing their spectrum and interactions. As far 
as we know most of these symmetries are, however, only approximate, being broken 
by forces weaker than the ones under consideration. It is nevertheless useful to 
pretend at first that these are exact invariances and to study the pattern of 
violations as a second approximation. Let us briefly derive Noether’s theorem 
in this classical context. 

Let @,°-'oy stand for N interacting fields. Some of them can be real or 
complex and carry Lorentz indices. For each value of x we consider them as a 
vector in an N-dimensional space where we are given a representation of a group 
G. The latter can leave certain subspaces invariant (such will necessarily be the 
case if we have fields having inequivalent transformation properties under the 


Lorentz group). If G is a compact Lie group we shall write T*(s = 1,...,r), the 
corresponding generators with T* antihermitian, and 
a a (a eae ta (1-136) 


Recall that the structure constants C**:,, can be chosen totally antisymmetric 
for a compact group. For brevity we do not distinguish the group from its linear 
realization on @. 

Let us consider space-time-dependent variations of the fields 


d(x) = ba,(x) T(x) 
d6'(x) = —da,(x)p"(x)TS 


under which we have #(¢) ~ L(¢ + 6¢) = Y, where F is a function of , Cu 
(and their complex conjugates), da,, and 0,6. A variation around the stationary 
solution will yield 


(1-137) 


aL OL 
O=Ol=— Va x ou — 6, —.——~ : 
| ~_ Oe i Tsay C— 
We define the corresponding currents 
; of 


Only the part of the lagrangian containing field derivatives will contribute to 
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js. Noether’s theorem follows from (1-138) and gives the divergence of these 
currents as 


0L(x) 


0, j4(x) = — : 
me(x) Ada, (1-140) 


where we have noted that 0Y (x)/Cda,(x) can be identified with the same derivative 
for constant variations dx,. A current j* will be conserved if the corresponding 
term on the right-hand side of (1-140) vanishes, which means that the original 
lagrangian is invariant under the one-dimensional subgroup of G generated by 
T,. Stated differently, to each such subgroup is associated a conserved charge 


Q, = [ax i°(x, 1) aes = [ex a jttx, t) = 0 (1-141) 


Let us return to the general case without assuming the conservation of the currents. 
This does not prevent us from defining the charges Q,(t) at time t. In what follows, 
t will be fixed and we shall not write it explicitly. If # stands for a functional 
of the fields and conjugate momenta at time t, we want to compute the Poisson 
bracket 


50,5F 60, 2m nai 


i GF — 3 pee <a e 
Qs. F} jer (2 bb, bg OT, 


The dependence of ¥ on @96a,(x) arises only from the dependence of # on 
Cod:. To simplify, let us assume that the fields are real (in which case T, is real and 
antisymmetric). If this were not the case, we would consider separately the real 
and imaginary parts. Then it follows that 


of OL 


BX) => = eb bo (x) = (x) T° (X) 
[Adu] oda] ane 
Q(x) = [ex m(x)T*P(x) 
We have, therefore, 
OF OF 
Ce Ta eS hee Sox nas 1-144 
(a. 7)= | wx (55 9 - =| 


In particular, 
{0°, o(x)} = T°o(x) —- {Q°, x(x)} = — (x) TS = T°n(x) (1-145) 
which can be rewritten 
{da,0°, (x)} = d(x) {6a,0%, 2(x)} = x(x) 


where 60(x) and 62(x) are the variations (1-137) of the fields under infinitesimal 
transformations. In other words, the charges Q*° generate these infinitesimal trans- 
formations through Poisson brackets in the same way that the hamiltonian 
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generates time translations. Similarly, we have 
{O' O=b= — [es nox) [ T°", Teer — C2, O”? (1-146) 


as should obviously be the case. The invariance of the lagrangian was not required 
to derive the above relations. Indeed, if H stands for the hamiltonian we easily 
derive that 


one aan ree . 
= (t) = {H(9, O(t)} = {a x35, OH) (1-147) 


We can generalize Eq. (1-146) to the equal-time Poisson brackets of the time 
components of the currents with the following result: 


{78(x, 1), 76(y, 0} = —C%,, 6°(x — y)je(x, 0) (1-148) 


In their quantized version these relations will give rise to the important develop- 
ments of current algebra (see Chap. 11). 


The reader will find it instructive to study similarly the equal-time Poisson brackets of the energy 
momentum tensor: ~ 


{O° (x, 1), Oy, t)} = [O*(x, t) + Oy, 1] GES°(x — y) 
{O°P(x, 2), O*(y, t)} = [OM(x, t) — gO Cy, t)] dF5°(x — y) (1-149) 
{O%(x, 2), Oy, 2)} = [OM (y, t) OF + O'(x, t) GF] 6°(x — y) 
from which we recover the Lie algebra of the Poincaré group: 
{PoP Os eee 
y pv Vyvya vit pay HyVY 7 m1 (1-150) 
Mihaela =g"J Big Hye — gt®yy — gue 


To close this section let us study a simple lagrangian involving two real fields ¢; and ¢2, 
which we shall describe using the complex (independent) quantities 


$1 + i¢ $1 — id 
g= = oft = (1-151) 
i ¥2 
Let us assume the dynamics invariant under internal rotations in the (1,2) space or, equivalently, 
under 


P(x) +e" h(x) P*(x) > e'** H*(x) (1-152) 


The constant e appearing here will be identified with the elementary coupling constant to the 
electromagnetic field, i.e., the electric charge. For instance, Y assumes the form: 


L ccatar = (On)*(0"b) — m?o*h — V(o*o) (1-153) 


with V an arbitrary smooth function, for instance, a polynomial. Noether’s theorem yields a conserved 
current 


age a 
ix) = a[oraGo] ~ iel*0, — (0,6*)] = ied* 3,6 (1-154) 


CLASSICAL THEORY 31 


We can, of course, directly verify this conservation using the equations of motion: 


ay. — pole 
(O + m’)¢ = 36" (OC + m?)o* = 26 (1-155) 
The charge 
Q= | exe t) = ie [ex o*(x, 1) doo(x, t) (1-156) 


is therefore time independent. We now wish to couple this system to the electromagnetic field. Since 
we are given a conserved current we might think of using it on the right-hand side of Maxwell’s 
equations. But some care must be exercised since the coupling of @ to the vector potential A, might 
modify the structure of the current itself. We seek therefore a total lagrangian given as a sum of 
three pieces™ Y= —4F>. ¥ scala, given by (1.153), and an interaction part ¥,4(¢, ¢*, A,): 


L= en ar scalar ar Fint (1-157) 


Let us assume that the current J” given by Noether’s theorem applied to the full lagrangian (1-157) 
is indeed the electromagnetic current. In other words, we have 


c OL int Cin 
J# = jt + ———_ = wv = — ——— 1-158 
T* afa,a(xy) 5A,(x) We) 
It is easy to convince oneself that the following interaction: 
Lint = —ieA,h* Ah + ce? A? H*h 
(1-159) 
= —Ayjt + PA*6*O 
fulfills these conditions in such a way that 
J* = j* — 2e? A"o*o (1-160) 
The full lagrangian is therefore 
? = —4F? + [(0, + ieA,)b]*[(0" + ieA")b] — m?o*o — Vib*d) (1-161) 
leading to the coupled equations 
oV 
O,F"=J"  —_[(8,, + ieA,)(@* + ieA") + m7] o = — ag" (1-162) 


We observe that (1-161) follows from a principle of minimal coupling to the electromagnetic field, 
according to which any derivative @, acting on a charged field (with charge e) has to be replaced 
by the covariant derivative 0, + ieA,. As a consequence, the lagrangian is not only invariant under 
the transformations (1-152) with constant « but under more general x-dependent (ie., local) gauge 
transformations : 


h(x) + oF) h(x) G(x) > eF) H*(x)_—— Ayx) @ A(x) + 8,00) (1-163) 


What was initially a trick to derive Noether’s theorem has now become a deeper property of electro- 
magnetic couplings which generalizes the gauge invariance already discussed for the free electro- 
magnetic field. This idea suitably extended to noncommutative groups yields very interesting model 
field theories (see Chap. 12). 
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1-3 PROPAGATION AND RADIATION 


1-3-1 Green Functions 
The dynamical equations of field theory are typically of the Klein-Gordon form: 
(Cl + m?)$(x) = jx) (1-164) 


where j may depend on the fields @ and extra indices have been omitted. We 
had already an exainple with Maxwell’s equations for the potential in the Lorentz 
gauge, where the mass term of (1-164) was absent. 

For the time being, let us assume the source j(x) to be given and m? > 0. 
We are thus dealing with an hyperbolic second-order partial differential equation, 
which determines ¢ in the neighborhood of a point x in terms of its values 
together with those of its normal derivative on a space-like surface element passing 
through x. Characteristic elements are tangent to the light cone, showing that 
causality is locally obeyed. 

In scattering theory one seldom has to tackle the problem in the way just 
mentioned. Boundary conditions on ¢ are rather imposed along space-like surfaces 
widely separated by a time-like interval. It is then useful to construct standard 
solutions to (1-164) where the right-hand side is replaced by a distribution con- 
centrated around a point x’. We shall generically denote G(x, x’) the solution of 


(Cl + m2)G(x, x’) = 64(x — x’) (1-165) 


with an appended suffix to characterize the boundary conditions imposed on G. 
The latter will most frequently be translationally invariant in such a way that the 
corresponding Green functions (or propagators) will only depend on the 
argument x — x’. From the superposition principle, solutions to (1-164) will be 
generated by 


(x) = GO(x) + | ate G(x — x')j(x’) (1-166) 


where #'°(x) obeys the homogeneous equation and is chosen in such a way that 
@ satisfies the boundary conditions. 

Making further use of translation invariance, (1-165) is solved through a 
Fourier transformation which replaces it by an algebraic equation. Setting 


1 2 
G(x — = Ge | atpern er Gip (1-167) 
we get 
(—p? + m7)G(p) = 1 (1-168) 


To divide both sides by — p* + m? we have to cope with the zero of this expression 
on the two-sheeted hyperboloid p? — m* = 0 (or the cone p? = 0 if m? = 0). This 
in turn is equivalent to prescribing in (1-167) a slightly deformed contour of 
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integration. We note that all these choices differ at most by a contribution 
9(P, Po/| Pol) 0(p? — m”) to G(p), an expression corresponding to the solution of the 
homogeneous equation. This choice is, of course, related to boundary conditions 
at infinity. 

Let us first define the retarded and advanced Green functions: 


= # 
Gret (P) ~ (po + ie) — p? — m? 
. (1-169) 
ae u |e ae 
von (2n)* (po + ie)? — p? — m? 


The ¢ prescription is equivalent to a slight contour deformation in the integration 
Over po and shows that G(x) is to be interpreted as a distribution. If we first 
perform the integral over po, in the case of G,,,(x), say, we can close the integration 
contour in the upper complex plane if x° < 0, without encountering any singularity. 
From Cauchy’s theorem we conclude that G,,,(x) vanishes for xo < 0. The opposite 
conclusion applies to G,4,(x). Itcan be checked that these distributions are Lorentz 
invariant so that G,.,(x) vanishes outside the forward light cone while G,4,(x) 
vanishes outside the backward one. These properties are in agreement with causal 
propagation. We also note that both Green functions are real, with G,g,(x) = 
Gre(— x). When m? = 0 we recover 


Gas (Dpwt=0 = 5 (#0) HC?) (1-170) 


adv 


while for m* > 0 the explicit expressions involving Bessel functions are not too 
illuminating. However, no matter what m? is, the singularity of these Green 
functions on the light cone remains given by (1-170), a reflection of the fact that 
the small x? behavior is entirely dictated by the differential operator in (1-165). 
The niass term is then responsible for the fact that the support is not concentrated 
on the light cone as in (1-170), but also involves signals propagating at a speed 
smaller than one. Returning to the integral representation (1-169) let us perform 
the po integral using the decomposition 


1 1 1 1 
(po + is? —p?—m? 20, piee ete Po + @p t+ ié 
with w, = ./p? + m’. We readily find 


eel 3 : : Nee ' 
Gale —xJ=1 ee \3 P fg leap xo — x4) + ip (x — XY. gitdelxo— x0) + iP — x} 
Dp 


We wish to give a physical interpretation of this formula. To do so, we may 
imagine the system enclosed in a very large cubic box of size L. The momentum 
integral is replaced by a Riemann sum, and each component of the momentum 
is of the form 2nn/L with n an arbitrary integer, in such a way that 
[1/(2n)?] f dp > 2/2. . 
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We then have 
Grol — x1) > 10(x0 — XO[E G4 pF p(X) — E O_plIGE p(x')] (1-171) 
The momentum p assumes only discrete (but very dense) values and the functions 
1 : 
Q (x )= etiw,xo+ip:x (1-172) 
ae BI 2/20 

stand for periodic solutions of the homogeneous Klein-Gordon equation. The 
plus or minus sign corresponds to the sign of the frequency. The relative minus 
sign in (1-171) [together with the factor i in front to build a real quantity: 
Pt p(x) = P_, —p(x)] comes from the fact that the solutions of the equations are to 
be normalized according to a nondefinite “norm” [see formula (2-5)]. We shall 

see that while gs has a positive norm, g~_ has a negative one. 
In any case the expression (1-171) leads to an interesting interpretation of 
propagation according to G,..(x — x’). Positive and negative frequencies are 
respectively propagated forward in time by the first and second term of (1-171). 


For G,qy(x — x’) we have a similar expression with x and x’ interchanged. 
The difference 


G(x) = Gra) — Gaay(x) 


“a3 |e e~ 'P** &(p0)6(p? — m?) (1-173) 


i 
~ Qn} 


is an odd function vanishing outside the light cone such that 


|Z p (ei pX0 + ip: elo pXo+ip *) 


a GOUsh eno = 0700) (1-174) 


It clearly satisfies the homogeneous Klein-Gordon equation. When m? = 0, it 
reduces to 


1 
G0) |p2=0 = = €(Xo) 6(X2) 
2a 
1-175 
= lim = | : : 
no 407i | (x — in)? (x + in)? 
where y is an infinitesimal positive time-like vector. 
Finally, the half sum 


PP 
Ola les) arenes ee ro On |e pre” Ps e — (1-176) 


is an even solution of (1-165). In (1-176) the principal part symbol applies to the 
Po integral, and we recall that 
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eee pp 5(Po — ,) 

_—= —___—_ TO(Po — 
Po 2, + te Po — @p ‘ 


5(p? = m?) = (Po = Wp) a (Do a Wp) 


20» 


(1-177) 


The above discussion takes place in a purely classical context. We shall, however, 
encounter in the quantized version another even solution to the very same 
equation, first introduced by Stueckelberg and Feynman. One reason to explain 
why it does not naturally appear in classical physics is that it is a complex 
distribution defined through 


u —ip-x 1 = 
Gr(x) = — One | d*pe = (1-178) 


It therefore satisfies 
(Cl + m’)Ge(x) = 6%(x) — Gr(—x) = Gr(x) 
If we discretize as above we find the equivalent form: 
Gr(x — x’) > i[ (xo — X0) EO +.p(X)GF p(x’) + O(X0 — Xo) Z Q— p(x) G* p(X’)] 
(1-179) 
While the previous Green functions were zero outside the light cone, this is not 


the case for Gr, which has an exponential tail for negative x*. Let xo = 0 and 
set r = |x|. Then 


’ = Otte 1/2 
ai oe eee 7) 1-180 
G-(0, r) aa (2n)?r [ dp p? = m* 4 (2n)?r? ( 2 


From (1-179) we read off that, according to Gr, positive (negative) frequencies 
propagate forward (backward) in time. This distinction according to frequency 
explains why G- is intrinsically complex. Note also that G,(p) is a meromorphic 
function of the complex variable p? in contradistinction to the previous Green 
functions. 


These expressions can be generalized to more general cases such as the electromagnetic one. Corre- 
sponding to the fact that Maxwell’s equation alone 


DA* — 04(0* A) = j* (1-181) 


is not sufficient to determine the potential A“ in terms of the conserved current j", we find in 
momentum space that the matrix 


Pe ti pps . (1-182) 


is singular. Its determinant vanishes identically. The operator L is in fact proportional to a projector 
since 


P=p'L (1-183) 


To avoid this difficulty, we can either add a small mass term or proceed as in Sec. 1-2 and introduce 


36 QUANTUM FIELD THEORY 


an extra term (A/2)(@- A)? in the lagrangian. The modified equations of motion are 
[Cg*, — (1 — a)0*8,] A” = j* (1-184) 
and the propagator becomes 


gai — een? 
p? + ie 


Gi?(x, a) = ~ or | ene (1-185) 


The apparent singularity in the numerator plays no role when G#" is convoluted with a conserved 
current. For A = 1 we recover the Feynman propagator g“”G,(x). We shall have more to say about 
this in Chap. 3. 


1-3-2 Radiation 


As an elementary application of Green functions let us recall the computation 
of the electromagnetic fields generated by a moving point charge. Let x(t) be its 
space-time trajectory and j“(y, t) the associated current given by (1-51). Causality 
requires to use the retarded Green function to solve for the potential. Moreover, 
the remarks at the end of the preceding section lead at once to the expression 
of the potential in the Lorentz gauge: 


ie 
AMY) = [ats a(y° = 2°) fy — 27 ]2) 

; (1-186) 
-2 | * de OLy® — x%(e)] {Ly — x(0)]7} 0") 


The dots refer to derivatives with respect to proper time. Let x, = x(t), the 
unique y-dependent retarded point on the trajectory, such that 


(v-x1?=0 xf <y? (1-187) 
The following identity holds: 
ad f(t+) 
| aril y®— x°(z)] o{fy=x@)]7} (@) = ———>— aie) 
— Paper (y = x+) 


From Eq. (1-187) and the fact that for a physical trajectory x, belongs to the 
forward light cone, it follows that X+ -(y — x+) > 0. We thus obtain the Lienard- 
Wiechert retarded potential : 


e xh 


A) 
ret(Y) 4n ee x (y = x4) 


(1-189) 


the relativistic generalization of the Coulomb potential to which it reduces in 
the frame where x4 = (1,0,0,0). Explicitly, ifr = y — x, and vi = (dx/dt)(t,): 


A°a(y) = 0) = (1-190) 


(B 
An(r—¥°V,) ~ 4n(r — r-v,) 


Dropping the + index, the corresponding fields read 


CLASSICAL THEORY 37 


e (r—rv)(l—v?)+r x [(r — rv) x dv/dt] 


E = 
An (r—r-v)? 


(1-191) 


and involve, of course, the retarded point. Both E and B contain terms in 1/r? 
which only contribute at short distance and 1/r radiated parts. The energy flux 
across a sphere ts given in terms of the Poynting vector as 

d& 


a= | as-exB)= | ange xe >o (1-192) 


At small velocities the relevant 1/r contributions to (1-192) are 


e dy r x dv/dt 
Evag = ane E x (« x *)| B..4q = @ - (1-193) 


which are the standard dipole fields. In the same limit the radiated power is 
given by the Larmor formula: 


a e |d@ avy e* (dv\ ae 2 () 
— =——; | = —} =—(|— == —|(— 1-194 
dt (4nx)° \ eo ( "i *) 8n (F) | dco ane’ 3 4n \dt ( ) 


These results apply to many interesting situations of which we are only going to 
consider a few. 

As an example let a charged particle oscillate nonrelativistically under the 
influence of a weak electric field of an incident plane wave of low frequency : 


dy _ a sEo en iwttik-x (1-195) 
dt m m 


Here ¢ is the polarization of the wave, Eo its amplitude, and k its wave vector. 
According to Larmor’s formula the power radiated in the solid angle dQ (see 


Direction of 
observation 


Figure 1-3 Radiation of a charged 
particle oscillating in a plane 
wave of wave vector k and 
polarization ¢. 
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Fig. 1-3 for the definition of angles) is given by 
e? dy\? dQ e? é ae. 5 
= -—; — |) a 6 dQ (1-196 
oF dae ( - *| r? ~ (any \ar) © i 


If the particle displacement during a period is only a small fraction of the wave- 
length of the incident wave, the mean value of (dv/dt)’ is 


dy \? av aye ee ’ 
(5)) sae, 7, a" 


() = : (=) Eo|? sin? 6 (1-197) 


4nm 


and thus 


The average incident energy flux per unit time and unit area perpendicular to 
the propagation is given by the average of the incident Poynting vector as 
+|Eo|?. We define the differential scattering cross section da/dQ as the ratio of 
radiated power per unit solid angle to incident power flux across a unit area: 


PL NEA 
oe = eS sin? 9 = r2 sin? 0 (1-198) 
We have introduced the notation r, for the classical electromagnetic radius 


(restoring the velocity of light c): 
2 h 
a (1-199) 


4nmc* mc 


Ve 


with a value of 2.82 x 10~'* cm for an electron, and « stands for the fine structure 
constant « = e?/4nhc ~ 1/137. The expression (1-198) is referred to as the Thomson 
scattering cross section; it is given here for an incident polarized wave. In terms 
of the angles f, d, and w defined on Fig. 1-3, we have 

sin? 9 = 1 — sin? B cos? (¢ — ) 
To obtain the differential cross section, we simply average over : 


do 
dQ 


=> 1-200 
unpolarized 2n ( 
Scattering is maximal both in the forward and backward directions. The total 
Thomson cross section is the integral of (1-200) over Q: 


8x 
Prot = 3 Ye (1-201) 
For an electron this yields 0.66 x 10~** cm’. We recall that this expression holds 
for weak fields and small frequencies w@ «mc?/h. When hw > mc? or 1 < h/mc 
(the quantum mechanical Compton wavelength), the initial frequency is no longer 
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m Figure 1-4 Bremsstrahlung. 


conserved, we encounter both quantum mechanical and relativistic effects, and the 
process is then called Compton scattering (see Chap. 5). To a good approximation 
the cross section is then given by the Klein-Nishina formula [(5-116) and (5-117)]. 
Let us only quote here the lowest-order correction in haw/mc* to Thomson 
scattering : 


h 
ou = (1 ~ G++) (1-202) 


We can study along the same lines classical bremsstrahlung, 1e., radiation 
by a charge suddenly accelerated. Let u; = p;/m and uy = p,/m be the initial and 
final four-velocities (see Fig. 1-4). Choose the origin of coordinates at the space- 
time point of acceleration, which can be thought of as the idealization of a collision 
process. The space-time trajectory is parametrized as 


LA t<O 

x(t) =2 (1-203) 
Ps came) 
m 


while the current 
d 
ix) =e | dr 54x — x(7)] 
dt 
= —- ie d*k eiksx Dis = Pi ) (1-204) 
(2n)* pik pl+k 
takes after Fourier transformation the simple expression 
i f 
a . p P y 
i(k) = ~ie( Ft oe) = j*(—k) 
We require that as t-» —oo the electromagnetic field reduces to the Coulomb 


field of the incident particle, since current conservation forbids the vanishing of 
ju(x) when t> — ©. In the Lorentz gauge we write, using the definition (1-173), 


40 QUANTUM FIELD THEORY 


A*(x) = [ate Geli = Co 
(1-205) 
= [ate G(x — x’)jX(x’) + [ate Galx — x)i"(x) 


We have exhibited the structure of A“(x) as the sum of a radiated field (solution 
of the homogeneous Maxwell equation) and a Coulomb field attached to the 
particle. Therefore 


At (x) = Je Go (x — x’)j*(x’) 
(1-206) 
= Gn om | atk eos * 8(ko)(k?)j"(k) 
while 


Fra) = (ns ap | ate e7 *** e(ko)d(k2)[k4i*(k) — k*F4(k)] (1-207) 
The density of radiated energy is given by (1-117): 


@°0 = FoR + 4g FPF, 


For a light-like vector k° = |k|, we define two orthogonal space-like polarization 
vectors &, satisfying 


e? = —1 €1°&2=0 &°k=0 (1-208) 
We readily derive that the emitted energy & at a positive time is 
1 
E= [ex @”°(t, x) = (On ap |S ko 25 | bx jC Cie (1-209) 


Let us anticipate the interpretation of this radiation in terms of light quanta, 
i.e., photons of energy momentum fk. Henceforth set h = 1. Then the energy 
emitted in a phase space element d°k will read 


leaek 


_ ge ae 
One? 


k-p) k- pt 
This semiclassical calculation enables us to obtain the number of emitted photons 
of polarization « by dividing by ko the energy of an individual quantum: 


dé ae a | 2 ee 
k kp 6k p’ | 2(22)7k° 
This result agrees, in fact, with the full quantum mechanical treatment (see Secs. 
5-2-4 and 7-2-3). By integrating over k (for small k) we note that the total energy 


is finite but the total number of photons is not. This is the infrared catastrophe 
to be discussed in greater detail later. 


2 


(1-210) 


(1-211) 
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The angular distribution given by (1-211) is of interest. In the frame where k“ =|k|(1, 1,0,0), 
et = (0,0, 1,0), e4 = (0,0,0, 1), taking into account that k -j(k) = 0 implies j° = j', we see that 


Y lee ik) |? = —j-7* (1-212) 
Therefore, 
e? 2p 5 p’ m2 m2 
dé = ee pe Se peer 
2(2n)° E “piikvenh) (ks pe (ks =| 


This can be rewritten in terms of the particle initial and final velocities, with k = k/|k| as 


, e? d?k | 21 vicwiee m? be m? | 
22a? |kPLG— kw —k-v,) ERL—k-vy? EX —k-v,? 


For the emission of soft quanta (|k|— 0) the radiation is strongly peaked in the directions of the 
initial and final velocities, a typical property of bremsstrahlung. 

To evaluate the total number of radiated photons, we integrate (1-211) from a lower value kmin, 
needed because of the infrared catastrophe, up to some maximum momentum k,,x, needed because 
of the unrealistic sharp angle in the trajectory (1-203). A typical cross section do¢oy will describe the 
collision process and the total cross section dépremz Will include the final emitted photons. To integrate 
(1-211) we introduce the notation q? = (py — pi)? = —4E? sin? 6/2 in the ultrarelativistic limit where 
E; ~ E; = E with @ the scattering angle, while in the nonrelativistic limit |v;| ~|vy;| =v. We use 
Feynman’s integral representation: 

1 
! | pee ee (1-214) 


(1-213) 


AB Jo [xA + (1 — 0B] 
to compute 
d?k(1 — vi" vy) : 2(1 — vi vy) 
a eS 
(1 —k+v,)(1 — k+v,) o 1-—[xv,+(1—x)vy] 
See ee ae 
4nt 1+ id sin? 5 + O(v*) _v «1, nonrelativistic 
- q m? eee 
fe in - z) + (=) q’ > m?, ultrarelativistic 
m q 


Furthermore, 


2 


~ m 
2k a a a 4 
{4 P= k-y? 
The final expression for the soft photon emission cross section reads 
do _ (de i kmax\ 2a ($v? sin? (0/2) nonrelativistic (1-215) 
dQ; Beane AQ 5] con : kin) ™ In (—q?/m?) — 1 ultrarelativistic 


In this quick survey of radiation problems we have neglected the interaction 
of the particle with its own emitted field. The same was true in the discussion 
of the motion in Sec. 1-3. As a matter of principle this is, of course, wrong, 
but except in extreme situations it is nonetheless a perfectly justified approximation. 
Indeed, from Larmor’s formula (1-194) the radiated energy 6 wad ~ 3(e*/4n) x 
(dv/dt)?(At/c?). As long as &,,4 is small as compared to typical energies in the 
problem, for instance, the particle energy &o ~ m[(dv/dt) At]? for a moving charge, 
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we can neglect classical radiative corrections. This leads to the condition 
2 
At >to == ——4a=5— (1-216) 


where a characteristic radiation time to has been introduced. On the contrary, 
this should become relevant if the forces vary appreciably in a time To or on 
distances of the order cto. 

The classical point charge theory due to Lorentz, including radiative cor- 
rections, was the subject of much controversy before the advent of quantum 
mechanics, which has changed the perspective considerably. Even though in most 
situations it is not the relevant approximation, it is instructive to understand 
the limitations of classical mechanics in its most elaborate treatment of point 
charges. Let us present an heuristic derivation of the correction f, to the Lorentz 
force law to take into account the self-interaction. We write 

u 


mit = m= eFY uy, +f" (1-217) 


The extra term f” should be a four-vector such that for small velocities one 
obtains from (1-217) the Larmor energy loss 


ddr, duo ene 2 é. (dv \? 
dt 


7 maar Ties ae 

in the nonrelativistic limit. Furthermore, from translation invariance it should only 
depend on u and its derivatives with respect to proper time. No new quantity 
independent of e and m with the dimension of a length should arise if the particle 
is structureless. Finally, we want to keep the definition of proper time, so that 
u? remains equal to one, from which we deduce that u-f has to vanish. 

The four-vector —(i-:u)u" = (u*)u" has a fourth component reducing to 
— (dv/dt)* in the nonrelativistic limit. The requirement f: u = 0 forces us to construct 
the combination ii” — (wu: u)u* orthogonal to u. Hence the above requirements lead 
to the classical Lorentz-Dirac equation in the form: 


2 


Dea y 
mu" = eFe uy + Ane [ui* — (u-ii)u"] (1-218) 


where F,,, represents the contribution of all external charges. The time component 
of this equation gives a relativistic generalization of the energy balance 


dé De? 
== OF oatty += ae (ui)?up + mtoti° (1-219) 


The first term on the right-hand side obviously corresponds to the work of external 
forces; the second is the dissipative (u? < 0) Larmor term. The third one (the so- 
called Schott’s energy term) is a total derivative. It can be neglected when taking 
averages over (almost) periodic motions or, more generally, when the variation 
of the acceleration is small during time intervals of order to. The original derivation 
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of (1-218) by Lorentz used a spherical model of the charge and was not free of 
objections from the relativistic standpoint. Dirac obtained the same equation in 
a fully relativistic way using local conservation of energy and momentum. Both, 
however, had to incorporate in the inertial mass an infinite (positive) contribution 
equal to the electrostatic Coulomb energy created by the charge, a typical re- 
normalization effect. If the observed mass were only electromagnetic in origin 
we would have to introduce in this Coulomb energy a short-distance cutoff a 
such that e*, 41a ~ me’, that is, a ~ cto ~ re. This is therefore the limit on distances 
where classical physics comes into difficulty. It is much smaller than the Compton 
wavelength 7 = h/me where quantum effects become important (r,/A = a ~ 107”). 
Therefore the latter always hides the small-distance classical effects. We shall see 
that rather than diverging linearly (with inverse length) as here, the “bare” mass 
of a spin 3 electron diverges only logarithmically. 

Even ignoring these infinities, we can expect difficulties at short distances or 
short times. Thus let us give a closer look at the equation of motion, omitting 
even dissipative and relativistic effects. We rewrite it in three-dimensional notation 
as 


m(¥ — to¥) = Fox: (1-220) 


In the absence of external force, apart from free-motion solutions vy = constant, 
it admits runaway self-accelerated solutions ¥ = Vo e“’. Mathematically, this arises 
from the occurrence of higher derivatives in the equations which lead to totally 
unphysical situations. To obtain sensible results we must insist on boundary 
conditions and replace (1-220) by an integro-differential equation which in- 
corporates these conditions (in particular ¥ + 0 at t > oo if F.,, vanishes in this 
limit). Then 


mv(t) -| ds e * F.,,(t +-ToS) (1-221) 
0 


In this form runaway solutions are eliminated but another unpleasant feature 
arises, namely preacceleration. If F,,, is zero for negative t (Fig. 1-5), then v does 
not vanish but rather starts increasing at earlier times of order to(~ 107 ** s for an 
electron), which is the time required for light to cross the electromagnetic radius. 
We can again convince ourselves that quantum mechanics should blur this un- 
wanted feature. Switching on an external field during a time At implies an un- 


Pe v(t) 


Figure 1-5 Preacceleration of a classical charge. 
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certainty in energy of order AE ~ h/At. If AE is comparable to mc?, we find 
At ~ h/mc? ~ to/« ~ 1377p. Classical acausal effects are therefore unobservable. 

We have glossed over the difficulties associated with radiative corrections. 
This should not prevent us, however, from applying these corrections under 
proper circumstances to study such ——— as line-width broadening, cor- 
rections to scattering, etc. 


NOTES 


For an account of classical mechanics and its application to field theory see 
L. D. Landau and E. M. Lifshitz, “Mechanics” and “The Classical Theory of 
Fields,” Pergamon Press, Oxford, 1969 and 1975. A deep treatment of mechanics 
is given, for instance, in E. Cartan, “Legons sur les Invariants Integraux,” Hermann, 
Paris, 1958. Classical electromagnetism and radiation is thoroughly discussed 
in J. D. Jackson, “Classical Electrodynamics,” John Wiley, New York, 1975. 
Consistency problems of electrodynamics are studied in F. T. Rohrlich, “Classical 
Charged Particles,” Addison-Wesley, Reading, Mass., 1965. 
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CHAPTER 


TWO 
THE DIRAC EQUATION 


To prepare the construction of a fully relativistic quantum mechanics, we follow 
the historical path by attempting to build a one-particle theory. We introduce 
the Klein-Gordon and Dirac equations and discuss their limitations. Applications 
involve the electromagnetic interactions, the relativistic hydrogen spectrum, and 
Coulomb scattering. The hole interpretation of negative energy states in terms of 
antiparticles requires a reformulation, the so-called second quantization, as a 
many-particle theory. 


2-1 TOWARD A RELATIVISTIC WAVE EQUATION 


2-1-1 Quantum Mechanics and Relativity 


Our first aim will be to try to accommodate the principles of quantum mechanics 
and of relativistic invariance, namely, to construct a Lorentz covariant wave 
equation. Along the way, we shall encounter increasing difficulties and incon- 
sistencies, which ultimately will force us to a complete recasting of our physical 
concepts. 

In quantum mechanics, the states of a system are represented by normalized 
vectors |y> (or density matrices p =X p;|W;><yil) of a Hilbert space #’: Kell? 
(or <g|p|g>) is the probability of finding the system in the state |p>. Physical 
observables are identified with self-adjoint, A = A‘ (but generally unbounded), 
operators on the space ¥. The expectation value of the observable A when the 
system is in the state ly, that is, the average value for many measurements on 
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identically prepared states, is <y| A|W >. The time evolution of the system under its 
self-interaction, or under external forces represented by classical given force fields, 
is described by the Schrodinger equation: 


ine ly(t)> = H|p(t)> (2-1) 
or, equivalently, 


\W(t2)> = U(te, t1)|W(t1)> 


H is self-adjoint and U is unitary and satisfies 
0 
ih aa U(ta, t1) = H(t2) U(t2, t1) 
C2 


It frequently occurs that a system is invariant under certain symmetries, for 
instance, symmetries of the external forces. A theorem of Wigner then states that 
such symmetries are represented by unitary (or antiunitaryt) operators, which 
map the Hilbert space onto itself, conserve the modulus of scalar products, and 
commute with H. 

On the other hand, special relativity states that the laws of nature are inde- 
pendent of the observer’s frame if it belongs to the class of frames—the “galilean 
frames”—obtained from each other by transformations of the Poincaré group. 
The latter is generated by the space and time translations, the usual space 
rotations, and the special Lorentz transformations (or boosts), which relate frames 
moving with constant relative velocity (see Chap. 1). The speed of light c is an 
absolute upper bound on the velocity of any signal. Information originating from 
the space-time point (Xo, to) reaches only points (x,, f,) inside the future cone 


ert, = to)? = (xX; = Xo)- = 0 5) — ily 0 


This is the relativistic expression of causality. For typical velocities much smaller 
than c, galilean mechanics is a reliable approximation. 

We may expect some trouble in the search for a relativistic and quantum 
description of a point particle. Indeed, relativity associates a momentum scale 
p= mc to a particle of mass m. But the uncertainty relations Ax: Ap ~ hi tell us 
that for length scales smaller than the Compton wavelength % = h/mc (4 = 3.8 x 
10~** cm for the electron), the concept of a point particle may suffer difficulties. 
Analyzing the position of the particle with a greater accuracy requires an energy 
momentum of the same order as the rest mass, thus allowing the creation of new 
particles. We see that we shall unavoidably be led to the concept of antiparticle. 
Nevertheless, in an intermediate range quantum relativistic mechanics is worth 
while and justifies the following developments. 


t An operator B is said to be antilinear if 
B(A|@> + p|W>) = A*Bl@> + u*Bly> 


Defining its adjoint Bt by <By|W> = <@|Bty>* = <B'y|o> B is antiunitary if <Bo|By) = 
<g|B'B\y> = <oly>* = Cyl». 
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To combine relativistic invariance with quantum mechanics let us return to 
the correspondence principle. In the usual configuration space representation of 
quantum mechanics, we associate the operators ih(0/ét) and (h/i) V; = (h/i)(0/éx') 
to the energy E and momentum p’ respectively. For a free massive particle, the 
energy is given in terms of the momentum by 


2 


a a + constant in the nonrelativistic picture (2-2a) 
E? = p*c? + m’c* in the relativistic case -(2-2b) 


Unless explicitly specified, we shall use the convenient system of units such that 
iow 

In the same way that the correspondence principle transforms Eq. (2-2a) 
into the Schrédinger equation for the wave function (x, t) = <x, t]W>: 


GG vy 
in ¥& t) = — W(x, t) 
it leads, in the relativistic case, from Eq. (2-2b) to the Klein-Gordon equation: 


G7 
(S — V? — n) Ux) 0) (2-3) 


Although this equation does not have the Schrédinger-like form (2-1), we may remedy this by casting 
it in a matrix form. Introducing the notations 


ay oy 


y-,=myp Yo=— ier: Bales 


the vector y = {Wu} (« = —1,0,..., 3) satisfies 


ee 
i = (mp +20 wy 


~ 


for a suitable set of 5 x 5 hermitian matrices. The reader may explicitly write this set of matrices and 
obtain an auxiliary condition in order to reproduce a set of equations equivalent to Eq. (2-3). 


If we want to interpret y as a wave function, we have to find a nonnegative 
norm, conserved by’the time evolution. There does indeed exist a continuity 
equation: 


Pe + div j = 0,j" =0 (2-4) 


where the four-vector j“ = (j° = p, j') is defined as 
me fi ? , ) 
P= Om (v ot ot v 


ae 
ain Vy — (Vy)*] 


(2-5) 
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In integral form we have equivalently 


7) : —_ 
-2{ 4 xp=| as j 


which expresses that the change in the total “charge” inside the volume V cor- 
responds to the flux of j through the surface S enclosing V. However, the density 
p is not positive definite. Therefore, it may well be considered as the density of 
a conserved quantity (the electric charge, for instance), but not as a positive 
probability. 

A second problem arises when we realize the existence of negative energy 
solutions. Any plane wave function 


W(x, t) = NeW HEte) 


satisfies Eq. (2-3), provided E? = p* + m?. Thus negative energies E = —./p? +m? 
are on the same footing as the physical ones E = ./p? + m’. This is a severe 
difficulty because the spectrum is no longer bounded from below. It seems that 
an arbitrarily large amount of energy may be extracted from the system. For a 
particle initially at rest, this will be the case if an external perturbation allows it 
to jump over the energy gap AE = 2m between the positive and negative con- 
tinuum of states. This is clearly a failure of the concept of stable stationary states. 
These reasons seemed at a time so overwhelming that they led Dirac to 
introduce another equation. Although the latter has a positive norm, we shall 
ultimately have to face the same problems of physical interpretation of the 
negative energy states. At that stage, we shall come back to the Klein-Gordon 
equation and recast our relativistic quantum mechanics as a many-body theory, 
where the negative energy states may be interpreted in terms of antiparticles. 


2-1-2 The Dirac Equation 


Since the Klein-Gordon equation was found physically unsatisfactory, we shall try 
to construct a wave equation 


En (a V+ fm) y= Hy (2-6) 


where yw is a vector wave function and «, f are hermitian matrices to make 
H hermitian, such that a positive conserved probability density exists. We now 
insist on the three following points: 


1. The components of y must satisfy the Klein-Gordon equation, so that a plane 
wave with E? = p” + m’ is a solution. 

2. There exists a four-vector current density which is conserved and whose fourth 
component is a positive density. 

3. The components of y do not have to satisfy any auxiliary condition, namely, 
at a given time they are independent functions of x. We shall also have to 
verify the relativistic covariance of this formalism. 
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Dirac proposed that the a; and f matrices be anticommuting matrices of 
square equal to one: 


{a,%}=0  forit¢k 
{a;, B} = 0 (2-7) 
a? = p2 =] 
with the bracket { A, B\ of two operators standing for the symmetric combination 


AB + BA, called the anticommutator. 
It is easy to verify that condition 1 is fulfilled: 


a? 1 2 
das Ns Hd 
sal" taht ib 
Let us introduce the notation y": 
°= B 
7 bo j= aes (2-8) 


(yy = 2gh 
and the Feynman “slash”: 


f= a,y" 
This enables us to rewrite the Dirac equation as 
(iy* 0, — m)p = (id — my = 0 (2-9) 


The Klein-Gordon equation is then obtained by multiplying by (i@ + m). Four 
is the smallest dimension in which matrices fulfilling (2-8) can be found. 


: ee i ‘ee mn 
The matrices x' and f have eigenvalues equal to +1; for i # j, det a'a/ = det — o’a' = f 1) det ort! 
thus their dimension d must be even. Since for d = 2 there exist only three anticommuting hermitian 
matrices, the Pauli matrices, we have d > 4. 


An explicit representation is provided by 


oer OO i -.) 
= mst? 


(2-10) 


in terms of the 2 x 2 unit J and Pauli a‘ matrices. This representation is useful 
when discussing the nonrelativistic limit of the Dirac equation. 


: Seyao -1 
Amongall possible equivalent representations, obtained by a nonsingular transformation : y— Uy U ‘ 
the Majorana representation plays a special role. It is designed so as to make the Dirac equation 
real. This is achieved by interchanging «) and f and changing the sign of a and «3 in the previous 
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representation: @, = —01,02 = B, a3 = —a3, B = 4. Then only f is imaginary and the Dirac equation: 
(esevim)ves 
— Fav + ipm |e 
ot 


is real: its solutions are linear combinations of real solutions. The matrix U which performs this 
change of representation and the new form of the y matrices may be easily determined (see the 
Appendix). 


In the four-dimensional representation (2-10), ” may be written as a bispinor 


y= a in terms of two-component spinors ¢ and . For reasons that will soon 
x 


be clear, g and y are referred to as the large and small components respectively. 
They satisfy 


(2-11) 


Itis interesting to notice the similarity between these equations and two of the four Maxwell equations: 


ra) é 
rane ead tbe ed 
Ot ot 


or, explicitly, 


Fe a(iB) 1 
[== -S-¥08) 2 s+ V 
i= =>S8-V(B) i——=—S-V(®) 


where (S') x = (1/i)eijn- 
The spin matrices S' play for the spin 1 electromagnetic field the same role as the Pauli matrices 
o for the spin 4 and (E, iB) is analogous to (¢, ). 


The main reason for the construction of the Dirac equation was to obtain a 
positive probability density p = j°, together with a continuity equation 0,j* = 0. 
Since y is a complex spinor, p has to be of the form w'#y in order to be real 
and positive. Let us first derive the Dirac equation for W'. From (2-9), we deduce 


W'n'* d, +m) = 0 
But y“' is easily expressed in terms of y": 
pr=y y= (Ba)! = aB = B(Ba)B = yyy 
Thus, introducing : 
y= yty? (2-12) 
we have 


VG + m)=0 (2-9a) 
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Combining Eqs. (2-9) and (2-9a) leads to 
Wd + dW = 0,(by"W) = 


We have, therefore, a candidate for the current 
jt =i} =p= Ww =v = glo t xx 
j= Wry = vray = gtoy + y'a9 
The density p is positive. Small and large components contribute equally to 


p whereas j involves cross terms. We shall see below that j* transforms as a 
Lorentz four-vector. 


(2-13) 


2-1-3 Relativistic Covariance 


According to the relativity principle, we want to verify that the Dirac equation 
keeps its form in two frames related by a Poincaré transformation. Alternatively, 
we require that to a system satisfying the equation with certain boundary con- 
ditions in a given frame we may associate by Poincaré transformations a family 
of transformed states satisfying the same equation, with transformed boundary 
conditions. 

Sticking to the first point of view (independence with respect to the observer), 
we first remark that translation invariance is obvious. Consider now a Lorentz 
transformation A. Let our system be described by the wave function y in the 
first frame and by w’ in the transformed frame. Both must satisfy the Dirac 
equation: 


iy® ae W(x) — mp(x) = 0 (2-9b) 
iy" ax ‘(x’) — my'(x’) = 0 with x’ = Ax (2-9c) 


There must be a local relation between w and yw’, so that the observer in the 
second frame may reconstruct ’ when w is given. We assume that this relation 
is linear: 


W(x’) = S(A)W(x) (2-14) 


where S(A) is a nonsingular 4 x 4 matrix. Equation (2-9c) now reads 


ipt 2 © Saya) — mS(AWC) = 


In order that this equation be a consequence of (2-9b) for any y, and since 
dx” /dx'" = (A~*)”,, we must have 


S(A)y"S~ *(A) = (A7*)v7" (2-15) 


Let us first construct S(A) for an infinitesimal proper transformation A, which 
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may be written as 
Magi tot (A =gt ot +" 


where the infinitesimal matrix w,, is antisymmetric. We write 


S(A) = P= Zope" + °° 
(2-16) 


S“'A)= I+ Zona" +--- 


where the matrices o,, are antisymmetric in pv. To first order in w, Eq. (2-15) 
yields 


[y", Cap] = 2i(g"ayp — 9"s'Va) (2-17) 


A set of matrices o,, satisfying this relation is given by 


l 
Tap = 5 [ye Ye] (2-18) 


A finite transformation is of the form 
S(A) = e~ W400" (2-19) 


where w** is now finite. 
For spatial rotations S is unitary, whereas it is hermitian for Lorentz boosts. 


The form of the finite transformations is most easily derived in the chiral representation for 


y matrices : 
0 -!I o 0 O06 
(oy — = = = 
ae * 0) ‘ @ ) ' a ) 


= 5 [7] = a= 7(¢ al 2-20 
oes ith t= tae 2 Nae, (2-20) 


i i oO, 0 
C= 2 [y: 7A | = ~5 [a:, a;] = Ei jx ie | 


In this representation, the two Pauli spinors of the decomposition of the bispinor w transform 
independently under rotations and boosts. The representation of the Lorentz group [more exactly, 
of its covering group SL(2,C)] is reducible into a sum of two inequivalent representations: (4, 0) + 
(0, 5). However, we shall see that the representation is irreducible if we include the transformation 
under parity (space reflection). 


We recall that the representations of the Poincaré group are classified accord- 
ing to the values of two Casimir operators P? and W?; P,, is the momentum 
energy operator, which is the infinitesimal generator of translations, whereas 
W,, is constructed from the angular momentum operator J,,, the infinitesimal 
generator of Lorentz transformations, as 


W,, = = Seaver P? (2-21) 
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If M* denotes the eigenvalue of P?, W? takes only values of the form 
W? = —M2S(S +1) 


where the spin S is integer or half integer. 
For the solutions of the Dirac equation, and therefore of the Klein-Gordon 
equation, P* = —d? takes the value m?, while J,,, is given by 


¥0)= (1-5 Jno" 49 
= ( = ; rat” W(x? — w?,x’) 


= (: - 3 Cnyvo"” + x,0"0,) Ut) 
which yields 
Juv = FOny + i(x, 8, — xy 0,) (2-22) 


Let us then compute W, from Eq. (2-21): 


1 
Wi ioe ae Euvpo ao" (2-21a) 


The orbital contribution has disappeared, justifying that W, corresponds to 
intrinsic angular momentum. We then use the identity 


Euapy ony pty’ = —det (Gzz') 
= oe (—1)” JaP, BP yIyP,, (2-23) 


I 


where in the first expression t (or t’ respectively) takes the values «, B, y (a’, B’, y’), 
and in the second one, the sum runs over the permutations P of (a’, f’, y’). After 
some algebra using the Dirac equation, this leads to 


W? = 3m? = —46 + 1m? (2-24) 


Thus the equation describes spin 4 particles. 
Finally, we derive the transformation law of the spinor w under parity. We 
have again to find S(A) satisfying (2-15), where A denotes the matrix 


yee (2-25) 


It is easy to see that 


W(x’) = ney W(x) (2-26) 
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is the desired transformation. Here yp is an arbitrary, unobservable phase. The 
important point is that the positive and negative energy solutions have relative 
opposite parities corresponding to the two opposite eigenvalues of 9°. After tite 
reinterpretation of negative energy solutions, this will mean opposite intrinsic 
parities for particle and antiparticle. , 

The various bilinear forms constructed from wy and w play an important 
role in the sequel. The remainder of this section is devoted to the study of their 
transformation properties under Lorentz transformations. From Eq. (2-14), we 
deduce that 


W(x) = WOody°S(Ay'y° 
= W(x)S~ (A) 


where the second expression is verified using the explicit expressions (2-19) of 
S(A) [and (2-26) for parity]. Thus a bilinear product w(x) Aw(x) transforms accord- 
ing to 


W(x) Aly'(x’) = W(x)S (A) AS(A) W(x) | (2-27) 
For instance, from (2-15), we learn that wy“ transforms as a four-vector: 
W'(x’Vy"b'(x') = At Wy” W(x) (2-28) 


whereas wW(x)wW(x) is a (nonpositive definite) scalar density. 

More generally, any 4 x 4 matrix may be expanded on a basis of 16 matrices. 
It may be shown that the algebra generated by the y matrices—a Clifford algebra 
for mathematicians—is nothing but the complete algebra of these 4 x 4 matrices. 
Let us introduce the notation{ 


yey = iy yy (2-29) 
In the representation (2-10) 
y= ( s) (2-30) 
The matrix y° satisfies 
iyiyvf}=0 and (pPy=l (2-31) 
We now consider the 16 matrices: 
= 
Mey 
PP yy = Ow = 5 [90] 
= 
R= 


t A consensus has not been reached on this notation. Some authors do not introduce the factor i 
(then y3 = —1), or use a different sign. 
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They have the following properties : 


1. (0%? = +1. 
2. Fomansiie(iee ge — 1) asthexeiexistea Such that rr? =.— Ter. 
3. Thus the trace of all T'*, except IS, vanishes: 


td’)? = —tr rer? = —triy*T* = 0 


4. For any couple (I'*, I’), a# b, there exists T° # > = J such that °F? =T° 
up to a factor +1 or +i. 


5. From these properties, we deduce the linear independence of our set {I}. 
Suppose that 


Yaak = 0 
Multiplying successively by all the T° and taking the trace leads to 1, =0 


for all a. 
6. We note the following identities : 


"Yu = 4 ov 2) 
Py y= 4h yyy Py yy = — 27" y?y" 
These and other useful identities have been listed in the Appendix. 


(2-32) 


Using this basis, we now give the properties of the corresponding bilinears 
w Aw under proper Lorentz or parity transformations: 


S: W(x)0'(x’) = Wo WO) scalar 

V W'(x'y W(x’) = AX W(x)y" W(x) vector 

di W(x')o""'W'(x') = AXA’ W(x)o?? w(x) antisymmetric tensor 
A: W(x’)ysy"W'(x’) = det (A)A“,(x)ysy"W(x) - pseudovector 

ie W'(x’)ysW'(x’) = det (A) W(x)ys W(x) pseudoscalar 


(2-33) 


The prefix “pseudo” refers to parity and x’ stands for x'* = A“,x". 


2-2 PHYSICAL CONTENT 


2-2-1 Plane Wave Solutions and Projectors 


We seek plane wave solutions of the free Dirac equation (2-9), 1. solutions 
of the form 


W(x) = e"*"* u(k) _ positive energy an 
W(x) = e*'*v(k) — negative energy 
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with the condition that k° is positive. To verify the Klein-Gordon equation, 
we also must have k? = m?. The positive time-like four-vector k” is nothing but 
the energy momentum of the particle (with h set equal to one). The Dirac equation 
implies 
(k — m)u(k) = 0 (K + m)v(k) = 0 (2-35) 

Let us assume that the particle is massive, m 4 0. In the rest frame of the particle, 
k* = (m, 0), and Eqs. (2-35) reduce to 

(y° — 1)u(m, 0) = 0 

(y° + 1)v(m, 0) = 0 
There are clearly two linearly independent u solutions, and two v’s. In the usual 
representation (2-10), we denote them as follows: 


1 0 0 0 
um, 0) = u?)(m, 0) = v(m, 0) = v(m, 0) = 
i 0 ; 0 ' 1 : 0 
0 — \o 0 
(2-36) 


We could now boost these solutions from rest up to a velocity v = |k|/k° by a 
pure Lorentz transformation, using Eq. (2-19). It is simpler to observe that 


(k — m)(k + m) = k? — m* =0 


E+m\'2 
( Aa ) om, 0) 


so that we may write 


k+m 
)(k) = ——____._ y(m, 0) = 
ee Soe) ee o-k A 
[amin + Eye 9) 
— (2-37) 
o 1/2 1(m, 0) 
v(k) = Lae v(m, 0) = [2m(m af E)| 
/2m(m + E) eee ee 
( 2m ) yo Un, 0) 


Here E denotes the positive quantity: E = k° = (k* + m?)"/?;; the two component 
spinors gy and y are the nonvanishing components of u(m, 0) and v(m, 0) respectively. 
For the conjugate spinors we find 


eatin. 
./ 2m(m + E) 
een 


,/2m(m + E) 


u(k) = u(m, 0) 


(2-38) 
o(k) = 0(m, 0) 
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The normalization factors have been chosen in order that 


u(k)u(k) ae Sap u™(k) vk) = 
2-39 
D™(k)v(k) = — dap v(k)u(k) = 0 on) 
Consider now the matrices 
Ak = Yo u(y @ 109 
a=1,2 
eee ee eee 
~ Imm +B) 
ene (2-40) 
2m 
where use has been made of the identity valid for k? = m?: 
(k + m)y°(k + m) = 2E(k + m) 
Similarly, let 
A-=-— ¥ vk) @0() 
a=1,2 
Se py —@ — m 
~ 2m(m + E) 
eta (2-41) 
2m 


The operators A, and A_ project over the positive and negative energy states 
respectively. They satisfy 


A4(k) = Ax(k) 
tr A+(k) = 2 (2-42) 
Ai(k) + A-(kK) =1 
The normalization in (2-39) is Lorentz invariant. However, the positive definite 


density per unit volume is p = j°(k) = W(k)y°W(k). Let us compute it for our plane 
wave functions 


YP 7 WOM) = (Rp uM(K) 


es i(k) ie u)(k) 


elles | (2-43a) 
m 
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for positive energy solutions, and 
YO (x) yy (x) = Dd (k)y°v(k) 


oe (2-43b) 


for negative ones. The spinors have been normalized at rest; since density times 
volume has to remain constant, when the latter is reduced by the contraction 
factor E/m the former must increase by the same amount. 

Positive and negative energy states are mutually orthogonal, if we consider 
states with opposite energies but the same three-momentum : 


p(x) =e ek ®) u(k) 
yx) = eilkx°+k-x) vk) with k = (k°, =k) 


95 
PIP xy (x) = em 2ikex® D)(k)you(k) (2 44) 


Eat —2ik°x® (8) iE _k 0 of (a) = 
ze v ( at a0 (k) =0 


The physical meaning of negative energy solutions has yet to be clarified. 
Also, our construction of plane wave states does not make sense for zero-mass 
particles. Those will be studied in detail in Sec. 2-4-3. 

To characterize the remaining degeneracy of the plane wave solutions u and 
v, we construct the projectors onto states of definite polarization. For any space- 


like normalized four-vector n(n? = — 1) orthogonal to k, we have, from (2-21), 
W- nS teypet ho 
: (2-45) 
= —Zysnk 


Therefore, in the rest frame 


Ww 
ew) i aa 


: 0 
In the usual basis, » = (5 *) If we choose n along the z axis, n= N3) = 


(0,0, 0,1), we see that the solutions (2-36) are eigenstates of —W>+n,3)/m = W3/m, 
with eigenvalues + 3 (spin-up) for u and v, and —4 (spin-down) for u‘?) and 
v'?). The projector onto u'(m, 0) and v(m, 0) may thus be written 


I+ ysis) 1 I+063 0 
2 7a) 0 I-o3 


After a Lorentz transformation, the spinors u(k), v(k) are eigenstates of 
— W-n/m, where n is now the transform of na): 


P(n3)) = 
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men 


— 


——* y(k) = 57s fu(R) = + 5 uk) 
w. £ (2-46) 
— (k= — 5 yst(k) = + 5 ¥(k) 


The plus sign refers to x= 1, the minus sign to «= 2. The projector onto 
u\)(k) and vk) reads 


P(n) = 3(I + ysh) (2-47) 
This expression remains valid for an arbitrary normalized vector n, orthogonal 
to k; P(n) projects onto the state which in its rest frame has a spin o-n/2 = 4 
for a positive energy solution, and a spin o: n/2 = —4 for a negative energy one. 
(Note the signs!!) 


We shall denote u(p, n) and v(p,n) the (positive and negative energy respec- 
tively) eigenvectors of P(n): 


P(n)u(k, n) = u(k, n) 


(2-48) 
P(n)v(k, n) = v(k, n) 


The projector P(n) has the following properties: 
[A+(k), P(n)] =0 
A+(k)P(n) + A_(k) P(n) + A+(k)P(—n) + A_-(k)P(—n) =1 
tr As(k)P(+n)=1 
Relaxing the condition on the norm of n, 


p(n) = +h 2st =n = 0 


may be interpreted as a spin density matrix 


trp=2 trp?<4 


There exists a particular choice of n, such that n is proportional to k in the reference frame. Let n, 


be equal to 
mh = (E ae x) (2-49) 
m’ m |k{ 
This definition of polarization is called helicity, and is such that 
*k 
P(n) A+(k) = (1 aE att (2-50) 


Therefore P(m,) projects over positive helicity, positive energy and negative heticity, negative energy 
states. 
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In the ultrarelativistic limit m/ko > 0, |k|/ko + 1, nf + k*/m, and 


Lt ys em 


P(+n)A+(k) > 5 
(2-51) 
1 - War fa —k +m 


P(+m)A-(k) > 5 


2-2-2 Wave Packets 


Let us proceed to the construction of normalizable wave packets. We would like 
to superpose only positive energy plane waves, the only physically sensible ones 
at this stage. However, we shall be led to inconsistencies and will have to abandon 
this requirement. Let w‘*)(x) be 


WP) = \en2 OB be, HPs aut(p) em * (2-52) 
The factor [1/(27)*](m/E) is designed to make the normalization condition simple: 
d°x 
Sas Kary} = Bee ee * 
[a (bax) |e [lever = P EE 2s b*(p, x)b(p’, x’) 
x u™*(p)yul@(p!) e(E-E)—itp—p)-x 


=) e = E oe, 0)? = 1 (2-53) 
(27) 


where we observe that d°p/E is a Lorentz invariant measure. 
We also compute the total current 


wx m? 
= | Hx jt, x) = \a | | dp ap“ D b*(p, 2)b(p',2!) 


"(pau (pl) eke Biases 


- (55 mie ys Fi ~ b*(p, a) b(p, a’)u (pau (p) 


We need here the Gordon identity which states that for any two positive 
energy solutions u(p) and u!”)(q) of the Dirac equation, we have 


1 
u(p)y" ug) = = u(p) [( (p aie q)" oe io*”(p _ q)v Ju(q) (2-54) 


Indeed, as a consequence of the Dirac equation 


0 = ap) [Ald — m) + (6 — maul) 
= amar aur + arr) SESH al [Paani 
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where a is an arbitrary four-vector. Equation (2-54) follows by differentiation 
with respect to a,. Using this identity together with Eq. (2-39), we may write 


' a? 
say ep lanal z= () os5 


Therefore the total current for a superposition of positive energy solutions is 
just the group velocity. This is analogous to what happens in the Schrédinger 
theory, and seems satisfactory. However, there is an inconsistency in the assump- 
tion of superposition of positive energy solutions only. 
To illustrate this point, let us consider the time evolution of a wave packet 
given at time t = 0 by a gaussian distribution of half width d: 
1 


W(0, x) = Gd eWX fd w (2-56) 


where w is some fixed spinor, say i 


The corresponding (normalizable) solution of the Dirac equation has the 
form 


dp m . Su 
— | sors eB au (py em #* + dP(p, ao) e”*] (257) 


Since the Fourier transform of a gaussian is a gaussian 
jax e-* Ad —ie-x — (2nd7)3!? e—PeP 
we may write 
(4nd?) e-PPR w= [blp, a)u(p) + d*(6, a(R) 


where p stands for (p°, —p). 
From the orthogonality relations (2-43), it follows that 


b(p, ot) = (4nd?)?/* e~ P°#*2 yt (pyw 


(2-58) 
d*(p, a) = (4nd?)?/* e~ P12 yt (pyw 
Using the explicit expressions (2-37), we see that the ratio b/d* is typically of 
order |p|/(m + E) and becomes important when |p| ~ m. If the wave packet is 
spread out over a distance d > 1/m, the contribution of momenta |p| ~m> 1/d 
is heavily suppressed, and the negative energy components are negligible ; the 
one-particle theory is then consistent. However, if we want to localize the wave 
packet in a region of space of the same size as the Compton wavelength, that is, 
d <1/m, negative energy solutions play an appreciable role. This quantitative 
discussion is in agreement with the heuristic arguments presented at the beginning 
of this chapter. For a wave packet with negative energy contributions as in (2-57), 
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we compute as above the normalization condition 


3 
| a  L(lb(e, a)|? + |d(p, 2)|?) = 4 (2-59) 


and, with the notation p = (p°, — p),.the total current is 


i d*p m {p' 2 casi 
J (t) = |e am 1) [| b(p, BI Se |d(p, x) | 


at 2 [b*(p, a) d*(p, at’) e2ikt a (p)o vp) 
— b@®, x) d(p, a’) (noun (2-60) 


It is now time dependent. Besides the group velocity term, there is a real, oscil- 
lating term. The frequency of these oscillations is very high—larger than 


& 
2m>-~2 10s Ss = 


This phenomenon, traditionally called zitterbewegung. is an example of the 
difficulties due to the negative energy states in the framework of a one-particle 
theory. 


A more striking manifestation is the famous Klein paradox. Let us idealize the localization process 
by a square potential barrier of height V in the half space : = x? > 0 (Fig. 2-1). Consider now in 
the z < 0 half space an incident positive energy plane wave of momentum k > 0 along the = axis: 


1 


0 
Wine(z) = e** k (spin-up along the z axis) 
E+m 
0 / 
The reflected wave has the form 
1 0 
0 1 
—k F 0 
zj=a e7 tkz ae b —ikz 
Vret( ) —) é k 
0 E+m 


(superposition of spin-up and spin-down positive energy solutions). In the z > 0 half space, ile., in 
the presence of the constant potential V, the transmitted wave has a similar form: 


1 0 
0 1 
ine q ; 0 
= (24 ee Sigs) 
Wtrans(2) ce Ear +de a 


THE DIRAC EQUATION 63 


Figure 2-1 Klein’s paradox in a 
square potential. 


with an effective momentum gq of 


q = [(E — V)? — my"? 
Writing down the continuity of the solution at z = 0, 
W(z) = 0(—z) [Winc(z) 4 Wret(Z)] + 0(2)Werans(Z) 
determines the coefficients a,...,d: 


pa (no spin-flip) 


q E+m 
{—a=re wherer=— ———— 
k E-V+m 
As long as |E — V| <m, q is imaginary and the transmitted wave decays exponentially; beyond a 
few Compton wavelengths, it is negligible. If we increase V so as to restrict this penetration region, 
the transmitted wave becomes oscillatory when V > E + m. 
The computation of the transmitted, reflected, and incident currents yields 


Jtrans ba 4r Jref _ (; zi “y ae i _ divans 
Jine (1 a r)? Jinc er 


Hine 
The conservation of probabilities does indeed look satisfied : 
Jinc = Jtrans ar Jet 


Unfortunately, since r < 0, the reflected flux is larger than the incident one! We are again in trouble 
when we try to localize the particle within a distance of the order of the Compton wavelength. 


In spite of these difficulties, the Dirac equation and its one-particle inter- 
pretation are very useful and physically sensible as long as we consider external 
forces which are slowly varying on a scale of a few Compton wavelengths. They 
provide us with the first relativistic corrections to the Schrédinger picture. This 
is what we are going to explore at length in the next sections, before returning 
to a deeper investigation of the meaning of negative energy states. We now realize 
that the difficulties which led us to disregard the Klein-Gordon equation have not 
been really solved. Even though we shall pursue this discussion in the framework 
of the spin 4 theory because of its important physical implications, we could as 
well concern ourselves with the scalar case within the same range of validity. 
This is another instance where important physical theories were constructed for 
what seems afterwards to be unconvincing motivations. 
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2-2-3 Electromagnetic Coupling 


We wish now to study the interactions of a Dirac particle with an external 
(classical) electromagnetic field characterized by its potential A,(x). The relevant 
coupling is obtained from the free Dirac equation through the minimal coupling 
prescription described in Chap. 1: ° 


0,7 0, + ieA, (2-61) 
The Dirac equation then reads 
(ig — eA — m)p(x) = 0 (2-62) 


This prescription ensures the invariance of the equation under gauge transfor- 
mations: 


(x) > ef*# (x) 
Y ¥ (2-63) 
A,(x) > A,(x) — 8 ,0(x) 
Here e denotes the charge of the particle; it is negative e = —|e| for the electron. 


The Lorentz covariance of this equation is clear. If we change our reference 
2 = : 
frame, the electromagnetic potential transforms as a vector 


A(x! = Ax) = (A~*),” A(x) 


and therefore the analysis of Sec. 2-1-3 may be extended to the present case. 
Equation (2-62) can be rewritten more explicitly as 


ju [a-(Fv— ea) + fm + ea? |y 
= (a'p+ Bm) + (—ea-A + eA°)h 
= (Ho + Ain) W (2-64) 


We note a strong resemblance of the interaction part H,,, with the hamiltonian 
of a classical particle in an external field —ev- A + eA°, in agreement with the 


interpretation of « as a velocity operator. In the Heisenberg representation, an 
operator 0 satisfies the equation of motion 


d 00 
a 00 = iLH, 0] + 


Thus, here, the position operator r and the gauge-invariant momentum z = p- 
eA satisfy 


ar. . 
i [dir] =e 

2-65 
ins) 0 Dn “ 
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with 
OA 
ES = -— ve 
ot : 
B=curlA 


The second equation is the operator version of the Lorentz force equation. In 
view of the paradoxes encountered in the preceding subsection, the interpretation 
of r and 7 as the position and momentum is, however, limited. 

To study the physical implications of these equations, we consider their non- 


eccercamt taal : ; I 0 
relativistic limit. We write y = (") and use the representation ~ = (; i 
Dé _ 


o = @ ‘ Equation (2-64) leads to 


0 
i= =a-ny + Ag +mo 


0 
iA =o mp + eA°y — my 


In the nonrelativistic limit, the large energy m is the driving term in (2-66). 
We introduce the slowly varying functions of time ® and X: 


gp=e ™® 


(2-66) 


. (2-67) 
y = mG 
These spinors satisfy 
i = ao:nX + eA 
(2-68) 
i= = ao: n® + eA°X — 2mX 
If we assume eA° «2m the second equation is solved approximately as 
X~—"0«o 
2m 
and the first one is the Pauli equation 
ey?) 
2 (C8 5 24° [0 (2-69) 
ot 2m 


This justifies the use of the terms large and small components for gy and x (or ® 
and X respectively). As for Eq. (2-69), it is a generalization to spinors of the 
Schrédinger equation in an electromagnetic field. After simple algebraic manipu- 
lations, 

(6+ x)? = oj0;n'n) = n? + 4[o,,0;][n', r] = 2 — eo 'B 
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we may rewrite it as 

—o°Bt eA |o (2-70) 
m 


The only spin dependence is through the magnetic interaction o° B. Restoring 
the factors h and c, 


H ——o°'B=-—yn-B (2-71) 


magn 


where the magnetic moment yp is defined as 
e he e 
y= "me 2 ‘ (=) Cr 


The spin operator S is ho/2. According to the definition of Sec. 1-1-3, the gyro- 
magnetic ratio g is 2. This is a nontrivial prediction of the Dirac theory, derived 
within the nonrelativistic context of the Pauli equation. 

The actual value of |u| equals 


~ eh 
lu] = — = 5.79 x 10-2 eV/G 


Radiative corrections affect the experimentally measured value of g by a tiny 
amount, which we shall study later. 


The Pauli equation (2-70) may be further reduced, if we consider a uniform magnetic field B = 
curl A, with the choice A = 4B x r, and neglect the quadratic term in A (weak field approximation). 


We obtain 
<n oe 
i—= 5 (L+28)-B ® 
21 


2m 


where L =r x p is the orbital angular momentum operator. The reader is invited to derive a com- 
plete set of solutions. 


The previous study may, in fact, also be performed on the quadratic form 
of the Dirac equation, 1e., without assuming the nonrelativistic approximation. 


Starting from (2-62), we multiply it by the operator (i? — eA + m). This yields 


[(id — eA — m*]p = Jo = (2) ae x o™[i0, — eA,, id, — eA,) — nly 


= a — eA) — o” Fiy—m ly= 0 (2-73) 


Therefore the spin-dependent term reads 


ea” Cu 
G5 5 Fw=—9 > (ia-E+o°B) 
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in the usual representation, and again corresponds to the value g = 2 for the 
gyromagnetic ratio. Notice that this value is a consequence of the minimal pre- 
scription (2-61). We could have written a nonminimal equation: 


: Ag e 0 = 
|e —eAy—ma 8 ae Fn [b= 0 (2-74) 


which would lead to g = 2 + Ag. Such an equation has to be used to study 
the behavior in weak fields of particles with g factors very different from 2. 


It is interesting to determine the energy levels in a uniform magnetic field. Assume the field B along 
the z axis. The potential vector .4 may be chosen such that A° = A* = A? =0, A” = Bx. For a 


stationary solution of energy E, y = e~'# (a) Eq. (2-66) reads 
ng 


(E — m)p = a: (p— eA)x 
(E+ m)y = o°(p—eA)p 
Eliminating y yields an equation for g: 
(E* — m?)p = [o- (p — eA)]? 9 = [(p — eA)? — eo Blo 
= [p? + eB? x? — eB(a, + 2xp,)|@ 


This is the hamiltonian of an harmonic oscillator. Since p,, p., and o, commute with the right- 
hand side, we seek solutions of the form 


(x) = ef? + P22) f(x) 


where f(x) satisfies 


2 
| : + (eBx — py)’ — eBe. |f (x) = (E” — m? — pz) f(x) (2-75) 


“ie 
Assuming the sign of B such that eB > 0, we introduce the auxiliary variables 
a Py 
= B a 
$ ve (x 2) 
Elam ~ 2 
eB 


a= 


thereby reducing (2-75) to 


(-ate v)f= af 


If f is an eigenvector of o. with eigenvalue «= +1, 


then f, satisfies 


(a- 2) £10) = —(at+a)f(2) a=+1 
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The solution which vanishes at infinity is expressed in terms of a Hermite polynomial H,(¢) 
fa= cel? HA) 

provided a + « = 2n + 1, n integer, n = 0, 1, 2,.... Therefore, the energy levels are given by 

E? = m? + p? + eB(2n+ 1— a) (2-76) 


and the corresponding wave functions may be easily written. These levels have both a discrete 
(n, «= —1;n+ 1, a= 1)and a continuous degeneracy (in p,). The latter may be reduced to a discrete 
one if we consider a particle in a finite box. Equation (2-76) gives a relativistic generalization of the 
Landau levels. The value g = 2 is such that the spectrum extends down to E? = m?. 


As a second example let us study the case of a Dirac particle in an electromagnetic plane wave, 
generalizing thereby the classical treatment of Chap. 1. The plane wave, which is supposed to be 
linearly polarized, is characterized by its propagation vector n, (n? = 0) and its polarization vector 
By (ce? = —1,e-n=0). We write A, =, f(€) where € =n-x and 6,A, = 7, A, with A\ =e, f'(¢). The 
quadratic equation (2-73) takes the form 


(—Ol — 2ieA +0 + e? A? — m? — ieXk A) =0 (2-77) 
Let us exhibit solutions of the form 


W p(x) = e7'?* wf) (2-78) 


with a Dirac spinor and p a four-vector which is not orthogonal to n. By adding to p some 
quantity of the form An, we may realize the condition 


p? =m? 


The interpretation of this four-vector is the following. In the frame where ¢° = 0, and thus A° = 0, 
E and A along Ox, B along Oy, and n along 0z, the operators p; = i@,, py = i€2, and i(@y + G3) = 
Po + p3 commute with the Dirac hamiltonian. The substitution of (2-78) into Eq. (2-77) leads to the 
condition 


2in > po'(6) + (e?A? — 2eA- p — iehA')p(2) = 0 


which is readily integrated as 


1/2 & SA 2 azper 
old) = (=) 5 ‘ HA i| ale A(E)"p AE ae oes) 
0 2n-p 0 n‘p 2n-p 


where u is a constant bispinor. Since (4.4)? = —n?A? = 0, we may write 


W(x) = (: os M4) (“)u el! 
2n-p 0 


where I stands for the action of a classical particle in a plane wave (damped at infinity), with 
P = mu(co) = p(co) [compare Eq. (1-84)]: 


e7 
a #6 


For if to satisfy the original Dirac equation and not only the squared equation (2-73), u has to obey 
some auxiliary condition. After some algebra, we find that 


1/2 
(id — eA — m)W,(x) = () (: ata Ee ilo — m)ue'! 


- -p-x-| | ae| = Ap — 
0 n‘p 2 


Therefore 


(p — m)u=0 
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and u = u(p) is a solution of the free Dirac equation. The reader will verify that the solution (2-79) 
has the correct normalization 


1 3 i 3 t 
On? dx W(t, x)w,(t, x) = d°(p — p’) 


and that the associated current reads 


, 7, Bu 1 “ u u aa A e* A” 
IN) = VolO7" Wold = = E aes (< np =| 


If A(é) is a quasiperiodic function of & (slowly damped at infinity), the average value of j” is 


showing the same phenomenon as its classical counterpart. These expressions were originally derived 
by Volkow in 1935. 


2-2-4 Foldy-Wouthuysen Transformation 


In the last subsections, the physical meaning of the Dirac equation has been 
investigated using a nonrelativistic approximation. It is worth while showing 
that this may be pursued in a systematic way. This is the purpose of the Foldy- 
Wouthuysen transformation. To be more explicit, we want to find a unitary 
transformation 


wae Sy’ 
which decouples the small and large components and where S may be time 
dependent. 
Let us call odd the operators such as « which couple large and small com- 


ponents, even those which do not (for example, J, B,...). Since w’ satisfies the 
equation 


id,’ = [e'S(H — id,) eS] = H'y' (2-80) 


our problem is to find S so as to get rid of the odd operators in H’ at a given 
order in 1/m. In practice, we shall do it up to terms of order (kinetic energy/m)° 
or (kinetic energy x potential energy/m7). This will lead to a further insight in 
relativistic corrections entailed by the Dirac equation. 

In the free case, we can construct S exactly. It is time independent and can 
be chosen as 


Sogo = —i 2 0 


|p| [P| 
Since (y- p/|p|)? = —J, we may compute e's and H' in a closed form: 
eS — efvP/|P)® — cos 9 + B ze sin 0 


Ip] 
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H' =e He * 
“= (co 6+ fp i sin i \ia-p + Bm) (cos 6-8 ia 6) 
=a: p(co 26 — Pl sin 28) + B(m cos 26 + |p| sin 20) 
The choice of 6 such that 


Sie ui cos 20 == 
eliminates the odd operator «: p and leaves 


H = (p? +m?) = BE = Blo? + py” 
as we would expect. In other words, we have decomposed H into a direct sum 


of two nonlocal hamiltonians +,/m? + p*. It is clear that these square roots 
cannot be represented in configuration space by a finite set of differential operators. 

In the interacting case, we therefore expect S to be of order m~', and we 
expand H’ to the desired order : 


H =H +i(S, #1) -515,[5, HM] - 215,515, #1 


+ sg (8. (5.(5.(5, 41M - $- 5188] + 205.05, 1) += 


Here use has been made of the general identity 


eB en 4 = DD S(ALAL 14 BI (2-81) 


where, in the generic term of the sum, A appears n times. This identity is easily derived by computing 
the successive derivatives of e*“B e~*“ at s = 0, and using them in the (formal) Taylor expansion at 
s=1. 


We start from H = fm+ © + & where © denotes the odd operator © = 
a*(p — eA) and & the even one eA®. The solution of the free case suggests that 


we take S$ = —if@/2 to first order. Then according to the above expression for 
H', we calculate 


co. 


. BO 
h C= [o,6]- 
a al ] prt 


o> of 1 
a © wos S-B)-as 
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o is now of order m~'. We then iterate the process. A second transformation 
e”'S’ with S’ = —i(B0'/2m), leads to 


H" = Bm + f ano" 
where ©” = O(m~*). Finally, a third step with S” = —(iB©"/2m) eliminates this 


odd term, leaving the desired hamiltonian 


(p—eA)  (p)* e 
He = —_ Oe ° 
B E + — +eA a Bo:B 


le e e 
+ | = ——320- curl E — ; maz di -82) . 
( mi o° cur ae o:Ex r) m2 div E (2-82) 
The interpretation of the various terms deserves some comments. The term in the 
bracket is the expansion (to the required order) of [(p — eA)? + m?]'/?. The 
second term eA°® is the electrostatic energy of a point-like charge, whereas the 
third one represents the energy of a magnetic dipole for g = 2. The term inside 
parentheses may be seen to correspond to a spin-orbit (s.o.) interaction. Indeed, 
for a static spherically symmetric potential, curl E = 0 and E = — VA°. Therefore 
dA° 1 dA° 


1 
o°(E x p)= me oD) = ran 


and the term in parentheses is 


e dA° 
panei nO 
Feo 4m?r dr” 

The magnetic field B’ = —v x E acting on the particle is responsible for this 
additional magnetic energy. Its interaction with the magnetic moment p (2-72) 
would give 

ae oR 


2m rr de oP) 


but due to the Thomas precession, this result is reduced by a factor 2. 

Finally the last term in Eq. (2-82), referred to as the Darwin term —(e/8m) 
div E, may be traced to the zitterbewegung. The electron position fluctuates by 
an amount ér such that <r?) ~ 1/m?, and its effective electrostatic energy is the 
average 


0? A%(r) 
J 


t 6rd piemere 
al ap <6r' dr! > 


<eA%r + dr) = eA) +5 
From spherical symmetry, this random fluctuation is 
ou 
3m? 


. éu 
<ér' 6r)> = a <or?> = 
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and the correction to eA%r) is 


div E 


e e 
0) — AA® = — 
eed) 6m? 6m? 

in good agreement with the sign and magnitude of the Darwin term. 

The reader will have noticed that as the Foldy-Wouthuysen transformation 
is time dependent, the expectation values of H’ in the state yw’ are, in general, 
different from those of H in the corresponding state . 


2-3 HYDROGEN-LIKE ATOMS 


An important application of the Dirac equation is the discussion of the fine 
structure of atomic spectra. This is a field notorious for the successes of quantum 
mechanics and its relativistic generalizations, including radiative corrections. 
In view of the intricacy of the Foldy-Wouthuysen method, it is quite remarkable 
that an exact solution of the Dirac equation in a static Coulomb field exists 
and leads to an excellent agreement with the observed results on hydrogen-like 
atoms. The difficulties discussed in Sec. 2-2 are not expected to play a significant 
role. In atomic physics, the relevant length scale, the Bohr radius ap = h/m,ca = 
0.53 A, is 137 times larger than the electron wavelength. For heavy atoms these 
difficulties do arise and other methods have to be developed. 


2-3-1 Nonrelativistic Versus Relativistic Spectrum 


Recall the nonrelativistic result obtained from the Schrodinger equation, a triumph 
for early quantum mechanics. The wave equation is 


A Ze" 
E a eae os Wr w(t) = : 
2 2 
where 1 ee ae 


Pn = K+ Ivar == 0,1... 


and m is the reduced mass of the electron nucleus system: 


m'=m,'+myi=m;! 
The quantization condition requires that n’ = n — (J + 1) must be a nonnegative 
integer (number of zeros of the wave function), and the energy levels are 


m(Za)? 
2n? 


a) = = 


n= 12 (2-83) 


Numerically, the Rydberg constant ma?/2 is equal to 13.6 eV. 
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The s-state (! = 0) wave function at the origin is 


3/2 
HO) = = (™*) (2-84) 


Since for a given n > 1, / can take integer values from 0 to n — 1, each level has a 
degeneracy equal to 54 “(21 + 1) = n?. This is related to a dynamical symmetry 
of the Coulomb problem according to a group O(4) of rotations in four-dimensional 
space, which was used by Pauli and Fock in the early days of quantum mechanics 
to give an algebraic derivation of the Balmer spectrum (2-83). This degeneracy 
disappears in the relativistic treatment leading to the fine splitting of the spectrum. 

Ignoring for a while the effects of spin, let us see the predictions of the 
Klein-Gordon theory when the electromagnetic coupling is introduced in a 
minimal way. Let E denote the total energy equal to the rest energy mc’ plus 


negative binding energy ¢«: 
2 
(e +2) +A-mi|p=0 


ro) fr a en A ee YA, DD | 
ies en (2 — yy? = (0 
Or? ior” r? r ( a) |? 


This equation is formally identical to the Schrédinger equation, after the sub- 
stitutions 


(2-85) 
or 


L? > L* — Z70? = AA + 1) 
> a — 
m 


eae 


E-= 
2m 


The orbital quantum number is shifted by 6,, | > 4 = | — 6,, where 
j= 1+4— [0+ $)? — 2202)” 

Zo" 

21+ 1 


and the principal quantum number n is similarly displaced by the same amount, 
since n’ = n — (1 + 1) must be an integer. Therefore, the energy levels are given by 
E2,—m? _ —mZ?o? EX 1 
2m 2 mm? (n— 6)” 


~~ 


+ O(a*) 


m 
a (2-86) 
og SL + [Z202/(n — 57] 
mZ2a2 mZ*a* 3 mZ*a* : 
yee ee lt + CO”) 
ie One 1)' 8 on ( 
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The second term is the nonrelativistic binding energy and the third one breaks 
the O(4) degeneracy. We are not going to discuss the pathologies of this case, 
namely, the singular behavior of the wave function at the origin due to the 
attractive term —(Z7?a7/r*) or the catastrophe which occurs when Z > 137/2 
(5, and thus E,, become complex!). Equation (2-86) is in poor agreement with 
the experimental situation, meaning that the effects of spin cannot be neglected. 


2-3-2 Dirac Theory 


We turn now to the predictions of the Dirac equation. Before constructing the 
wave functions, we first proceed to a simple derivation of the spectrum, as in the 
Klein-Gordon case. For this purpose, we square the equation, as in Eq. (2-73), 
and insert the nonvanishing component of the potential Ag = —(Ze/4zr), while 
Fo; = —Fio = —0;Ao = E*. It is convenient to work in the chiral representation 


; : 0 : 
(2-20) for the y matrices where o°®! is diagonal: o°! = 7 ‘a a Then the spin 


term in Eq. (2-73) reads 


a 


e 
5 oO Pw = tier? = F iZa— 


where 7 is the unit vector r/r. The analog of Eq. (2-84) is an equation for two- 
component spinors: 


ao 92 @ A eee . | 2268 . Z 
|- (+ S)+u — Za’ + iZao a : =—(E* 799) |= 0 


The total angular momentum J = L+S=L + o/2 commutes with the hamil- 
tonian and with L’. In the subspace where J? = j(j + 1), J; =m (j =4, Bes 
—j<m <J), and I’ = [(1+ 1), the integer | takes two values: |!=j+4= 1. 
and / = j — 4 =/_. Since o 7 has no diagonal matrix element, d+ leah =O) 
is hermitian and has a square equal to one, the operator [I? — Z*a? $ iZae- F| 
assumes in this subspace the following form: 


G+ aU + 3) — 270? FiZa 
+ iZo G —2)G + 4) — 270? 


[? — 2a? F iZaa-? = ( 


Let A(A + 1) be its eigenvalues 
A= [(j +4)? — 27 07)])/? and A=[(j +4)? — Z?07])/? ~ 1 
which may be written 


A=(f43)- 


' it 1 Z? a? 
th Ty 1 GEES | Ss gee One 44 
Wi oF ae (i+5) Lio ~ Fa + O(Z*«*) 


Again, n is also shifted by 6; so as to keep n' = (n — 6;))-—A—1 integer. The 
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condition n’ > 0 restricts j to the range j <n — 3 for A=j+4-—06,andj<n- 
5 for A = j — 4 — 6;. Therefore, there is a twofold degeneracy except for the state 
j=n-—+4. The final result is 
. m mZ? x? mZ*«* n 3 mZ*a* 

J1 + [Z707n = 5))7) 2n? (Qj +1) 8 


Ey 


(2-87) 


with n = 1,2,... and j = 4, 3,..., n — 4. A catastrophe occurs now for Z = 137; 
01/2 becomes imaginary beyond this value. 

The degenerate states may be distinguished by their orbital angular momen- 
tum /, which takes the values j + 3 (except for j = n — 4 where | = n — 1); this is 
in turn related to the transformation property of the state under parity. The 
energies of the low-lying states have been represented on Fig. 2-2, where we 
have used the customary, nonrelativistic spectroscopic notation nl;. 

The new phenomenon is the occurrence of the fine structure, i.e., the difference 
in energy between levels of different j, for the same value of n. Typically, for 
Z=1, 
mo 453 x 1075 eV 

ep en 


=O ONGTHIZ 


E(2P3/2) = E(2P1)2) = 


This fine splitting may be seen as a consequence of the spin-orbit coupling in (2-82): 


AE Za o*L 
~\ 4m? 73 


This term vanishes for an s wave, whereas for a p wave, L- o = (J? — L? — 7/4) takes the values 
1 and —2 for j = 3/2 and j = 1/2 respectively. On the other hand, the expectation value of 1/r? is, 
on dimensional grounds, of the form <nlj1/r?|nl> = kn(mZa)*, where kq is a pure number. Any 
textbook on quantum mechanics tells us that k,, = 82! + 1)n3[(21 + 1)? — 1] (= za for n= 2 and 
f= ie 


m(Za)* 


AE, ..(2P3;2 — 2P1)2) = 30 


in agreement with the previous estimate. (This holds also for higher values of n and 1.) 


Let us now construct the spinors which are the energy eigenstates of this 
problem. Returning to the Dirac representation (2-10), we write the bispinors 


y= bay and look for two-component spinors which are eigenstates of J Oe 
x f 
and 1? with eigenvalues j(j +1), m, and I(J + 1) respectively. Let gj,’ be the 
eigenstate for j = 1 + 3. 
As eigenstates of L*, p‘;) must have the form 


Oy Yh ; 
Pm = (a ) (no summation over p and LW’) 
u’ ; 
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where we use the standard notation Y,“ for spherical harmonics. They must also 
be eigenstates of J, = L, +S, with eigenvalue m and eigenstates of L:-o= 
J* — L? — 3 with eigenvalue / for y+) and —(/ + 1) for o;). The first condition 
gives u = m — 5, and p' = m + 4, the second one, together with the normalization 


3D5)2 —_ 
3P. 
7/2. —__ 3D5/,) =a 
3512 —_ 
é 3Pyj. —_ 
2 
fii WS = 
> Py), —_ pets 
ine structure 
25: 
a 1/2 —_ Lamb ] 10.9 GHz 
m(Ltv aa)" shift 2?) ).—_ 
2 1057 MHz 
ie Triplet 
m./ 1—a2 Hyperfine structure 
1S; )_—_ | 1420 MHz 
Singlet 


Figure 2-2 Low-lying energy levels of hydrogen. 
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condition |a|? + |b|?, determines a and b. We finally get the spinor harmonics 


oP rei? es) ne 
(l—-m+4)2 yrrie 


(l= m4 42 vpn) 
—(1+m +4 4)! ymti2 /= 1-4,1>0 (2-89) 
The phase has been chosen in such a way that 

Phim = OF Pim (2-90) 


Since o* Ff is a pseudoscalar operator, g\,,’ and ¢,,’ have opposite parities (also, 
their angular momenta /! differ by one unit). It is convenient to introduce a 
common notation. Let Pim denote oS,) if j=1+4 and o,) if j=1-4. We 
verify by inspection that 9j,, has parity (— 1)’. 

Since the Dirac equation in the Coulomb potential 


j=lt+ 4 (2-88) 


% Seteaiie| 


Ey = Ga V+ Bm — 21 y = Hw (2-91) 


is invariant under a space reflection, odd and even eigenstates may be constructed, 
namely, 


BUS (X) = EWS ma () 


It is clear that the spinors 
a (o- r) Pim 


have parity (— 1)’. In (2-92), the factors i and 1/r have been introduced for later 
convenience. Noting that H in (2-91) reads 


we do the intermediate calculations 


Bf ln = 9° HOF 0° BF) = A (tp + fo LS 0) Om 


a tr OO 4 1 FU +12)] fio} gin  forj=l+4 
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and, similarly, 
(6° p)(a-F) f(r) Pim = aa pee +1l+(j+ 5) [eve 
P ai r\ wor = Zs _ 


Our lengthy separation of variables results in the pair of radial equations 


Z de eis: 1 ) Hag 


L wer 
(e+m+ 74) Byte) = peal (i+ 5) 
r dr 2 


(2-93) 


To solve this pair of equations, we introduce the notation i =. m? — E? (since E? < m’), the new 
variable p = 2dr, and, for | and j given, the new functions F,(p) and F2(p): 


E 1/2 
G(r) = (1 + =) e~ !?(F, + F2)(p) 


E\!? 
Fin (1 = =) eA {F, — F2)(p) 
m 


By eliminating F2 in favor of F;, we get a second-order differential equation, the solution of which is 


y — ZaE/A ( Zak ) 
= eee F a Oy +f: 
‘~~j+zeme gt 
ZaE 
Fob aes 0 
with y= (44)? — 2707]? =f +4- 5, 


Here F(a, b; p) denotes the degenerate hypergeometric function solution of 
d? d 
— + (b—- p)—-—a|F ;p)= 
Gon ?) ip a| (a,b; p)=0 
For large p this function behaves as [T'(b)/I'(a)]p*~°e’. Demanding that F,; and G,; be normalizable, 


that is, JF dr(F?, + G7) = 1, implies that [Ty — ZaE/A)]~! must vanish. This is the desired quantiza- 
tion condition 


ZaE as 
ney v— ie (nonnegative integer) 
=n—(j+3) 
which leads to (2-87): 
ZaE 
(m? — E2172 0) 


Collecting all the factors, it is then possible to write the expression of the normalized solutions 
F,; and G,;, and therefore of the Wjn- 
We only quote here the form of the ground-state wave functions 
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0 
(2mZa)*! t+y \? eee eon) 
Wn=1,j=1/2,m=1/2 = “an? \2rd +2) (2mZar)’~* e~™ a 6 
{l= , 
i - ” sin 6 e® 
a 
and (2-94) 
0 
1 
(QmZa)?? (14 ” : i(1 — y) 
_ 4 redo bs 2mZ y~-1 ,—mZaer pS ee, 4 —-i9 
Wn=tyet 2m= 12 eee (Gra ep (2mZary’~ 1 e = sin 6 e 
== 
Zo 


Note that y=. 1 — Z*x* ~ 1 — (Z?x?/2). In the nonrelativistic limit y > 1, we recover Schrodinger 
wave functions multiplied by Pauli spinors. On the other hand, these wave functions are singular 
at the origin, but this effect is noticeable only for 


DniBarisewt'* ~ 10 193092 


i.e., in a rather tiny region! 


To compare these results with the experimental levels, several other effects 
have to be taken into account. 


Hyperfine structure As a first approximation, we have neglected the magnetic 
field induced by the magnetic moment of the nucleus (we consider henceforth 
the hydrogen atom Z = 1). The coupling of the proton spin with the total 
electron angular momentum splits the levels into doublets. To get an estimate 
of this effect, let us use a nonrelativistic approximation for s states. The new 
interaction is 


€ 
a= ——__0, 5 
hf ae 
where B=curlA A= —— 1, x V— 
An 


«, denotes the electron spin, and yp, the proton magnetic moment. We veal that 
for a current distribution j(x), the magnetic moment is w= 2J r x j(x) d°x and 


AA = —j; therefore for a moment localized at the origin j = —# * V6? (x). Hence 
(oa! 1 
2 eee v\1= 
bt eens, a.°[V X (Hp X Y)] : 


e 1 
a me °° [apA em (Hp° V)V] S. 
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In s states, we only need the angular average of Hy, ViV; can be replaced by 
1 
3 Adi;, and 


( 


Hip = — a Fe" Mp O'(r) (2-95) 


3m 


Introducing the proton gyromagnetic ratio gp, Hp = —(gpe/2m,\(e,/2), we obtain 
= e2 0 2 
(Ay = Dae 9pFe* Fp|W(0)| 


For the ground state, n = 1 [compare (2-84)]: 


(mea)° 
a 
VON ae are 
and o,°6, takes the values 1 in the triplet state and —3 in the singlet one. 
Finally, we get 


a e 

AEns (triplet-singlet) = — m.a* ee gp = SRO 10" °eV = ee & 10eHz 
n=1,j=1/2 5 Mp 

If we compare this splitting to the fine splitting, we see that the dominant 

effect is a reduction by a factor m,/my ~ Hp/ue. The previous estimate may, of 

course, be extended to higher waves (1 > 1). 


Radiative corrections There are several kinds of such corrections. First, the 
excited states are unstable—they acquire a width—and the atom may undergo 
a spontaneous transition to a lower state. 


In a nonrelativistic dipole approximation, the probability per unit time of radiative transition between 
two states A and y per unit time is 


NG ao ; 
Vinci ae Saeuie <u || DI Ad 


where <j ||D|| > is the reduced matrix element of the dipole operator D = er, as defined by the 
Wigner-Eckart theorem: 


, : 1 : ; 
<j yty|Dg|Ajam,> = ya pr? Jetta Lalisms><u||D]2> 


In particular, for a transition 2P > 1S, we find 


Wis-op = 6.2 x 10®s-1 = 4.1 x 1077 eV 


Second, the charged particles interact with the fluctuations of the quantized 
electromagnetic field. The latter vanishes only on an average. As a result the 
levels are slightly shifted. Although a complete and systematic treatment of these 
effects requires the methods of quantum field theory to be described later, we 
may give, following Welton, a qualitative description of the main effect: the 
Lamb shift. It is based upon the same kind of argument as the discussion of 
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the Darwin term in the last section, but here the fluctuation of the position is 
due to the electromagnetic field, rather than to a relativistic zitterbewegung. 
Hence the new contribution to the hamiltonian has the form 


Ls aes $ <(dr)*> AV. 


where V = —(Za/r) and therefore, according to Poisson’s law, AV = 4nZa6°(r). 
In this simple picture, treating this perturbation to first order, only s waves are 
affected and the nth level is shifted by the amount 


2nZ 
AE yamp(t) = —— <(58)?>|Yn(0)]? 


_ (2mZax)° a > 
= 12 Ae) <(6r) > 


(For higher waves, the shift is reduced by the vanishing of the wave function at 
the origin, and is considerably smaller.) The estimate of <(5r) > relies on a classical 
description of the motion of the electron in the fluctuating field (the nucleus, 
which is heavier, does not move). The electron oscillates according to 


mor = eK 
The Fourier component E., with frequency w of the electric field contributes 


e 
Molo = — —> Eo 
wo 


and assuming that there is no correlation between various modes 
2 (oe) 
e dw 
<(dr)*> =a | sr 3 <Ee> 
m 0 @® 


Anticipating the quantum treatment of the electromagnetic field, we suppose 
that the field is an incoherent superposition of plane waves and remember that 
the vacuum energy of such a field is the sum of zero point energies 


5 | axe? + B= Spe 
2 es ae 


over all wave numbers k and polarizations A. In a big box of size L, k= (27/L)n 
(n,, Ny, and n, integers) and 


Dae 2 oF 3 | ak Mr 
5 fe x +B) =20? | Os 5 
cB) = | do <2) = py | Px? 


—_—- 


2n? 


d°k dw a? 
ico 
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Therefore, 


mm? @ 


<(6r)?> = = \ 2 


The last integral is divergent both for small and large frequencies. The ultra- 
violet divergence when w — co comes from our poor quantum treatment; there 
is actually a cutoff for distances of order h/mc, that is, frequencies w ~ mc? /h. 
At the other end, the infrared divergence for small frequencies should be cured 
by a more accurate treatment of the electromagnetic field in the presence of the 
charges. The large wavelength modes are sensitive to the low-lying electronic 
states. This suggests an infrared cutoff of the order w ~ c/a = mc*a/h. Using these 
crude estimates we expect a value 
(arty ~ 2% n= 


™™ 


and therefore a Lamb shift 


4 mZ*0 1 
AE} amb(" ) oa a 7 In ( S19 


a 


For the level n = 2 of the hydrogen atom the calculated shift 2S,,. — 2P,)2 is 
AE, amb ~ 660 MHz 


in rough agreement with the observed shift of 1057 MHz. The comparison of 
this term with the Darwin term in Eq. (2-82) shows a reduction factor of order 
a In 1/ax. These approximations will be considerably improved in the sequel (see 
Chap. 7). 


Nuclear effects The nucleus has a finite size and its charge distribution is not 
concentrated at a point. For the proton this is represented by a form factor. 
This affects predominantly the s-wave states, since higher /-wave functions vanish 
at the origin. An interesting consequence is the isotopic effect, predicted as early 
as 1932, where different isotopes would have slightly shifted levels. 


For light nuclei, the main contribution comes from the mass difference of the various isotopes: 


) oe 


electrons 


1 
B~ > A/ : 
2 \m, 


since the reduced mass of the electrons is 1/m = 1/m, + 1/m,y. For heavy nuclei, however, the finite 
size of the charge distribution is the leading effect. In a nonrelativistic approximation, we write the 
correction as 


ucl 


bE~e [es W(x)? [ve ze i 
4nr 


where V(r) is the true potential and —(Ze/r) its Coulomb approximation. 
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bE~e |y(0)|? [ex lv a Z| 
4ur 


~~ 


|y»(0)|? [ers r2AV 


e 
= 6 |w(0)|? Ze<r? Dancin 

where we have inserted Ar? = 6, integrated by parts, and used the Poisson law: AV = —p, 
fp@x= —Ze. According to (2-84), the effect is proportional to Z*. This effect might be used in 
practice for isotopic separation. A given isotope is excited to some level by a first laser ray and then 
ionized by a second laser ray. An electrostatic separation may then be achieved. 


A second consequence is that the critical value Z, ~ 137 beyond which the ground state becomes 
unstable is pushed to larger values Z ~ 175. 


Two-body relativistic corrections A correct treatment should also include the 
recoil of the nucleus. This is a difficult problem, which may be tackled by a 
relativistic two-body equation (Chap. 10). An heuristic reasoning may, however, 
shed some light on this point. From the predictions of the Schrodinger and 
Klein-Gordon equations [(2-83) and (2-86)], we see that in both cases they can 
be interpreted by imposing the following condition on the velocity v: 

in ar 

yl = n= positive integer 


Indeed, in the Schrodinger case, 


¥ a? 
eae ie 
2 2n 
whereas for the Klein-Gordon equation vy? = p*/E?, 
Ea Fe m 
ro or ~ (+ a2/n)? 


and the desired result is obtained by replacing n by n— 6; as in (2-86). In a 
two-body problem, let us assume that the relevant velocity is the relative one, 
namely, the velocity of one of the particles measured in the rest frame of the 
other. Then 

2mM 


Eto = (p+ PY’ =m! + M° + = ain 


which is symmetric in the interchange mo M. Using again our empirical rule 


a 


~~ op 


y2 


we obtain 


2mM 


2 eee 2 a 
we EM) TF he FY” 


. 
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wf, aM | 
oF aie ~ On? m+ M 8n*m+M (m + M)? 
4 
a” mM 1 + O(a) 


we mt+M 41 
This expression reproduces fairly well the recoil effects obtained in a more 
refined treatment. 


2-4 HOLE THEORY AND CHARGE CONJUGATION 


2-4-1 Reinterpretation of Negative Energy Solutions 


In spite of the successes of the Dirac equation, we must abandon our ostrich 
policy and face the interpretation of negative energy solutions. As explained 
previously, their presence is intolerable, since they make all positive energy states 
unstable in the final analysis. 

A solution was proposed by Dirac as early as 1930 in terms of a many- 
particle theory. Although this shall not be the final standpoint, as it does not 
apply to scalar particles, for instance, it is instructive to retrace his reasoning. 
It provides an intuitive physical picture useful in practical instances, and permits 
fruitful analogies with different situations such as electrons in a metal. Its major 
assumption is that all the negative energy levels are filled up in the vacuum state. 
According to the Pauli exclusion principle, this prevents any electron from falling 
into these negative energy states, and thereby insures the stability of positive 
energy physical states. In turn, an electron of the negative energy sea may be 
excited to a positive energy state. It then leaves a hole in the sea. This hole 
in the negative energy, negatively charged states appears as a positive energy, 
positively charged particle—the positron. Besides the properties of the positron, 
its charge |e| = —e and its rest mass m,, this theory also predicts new observable 
phenomena: 


1. The annihilation of an electron-positron pair. A (positive energy) electron falls 
into a hole in the negative energy sea with the emission of radiation. From 
energy momentum conservation at least two photons are emitted, unless a 
nucleus is present to absorb energy and momentum. 

2. Conversely, an electron-positron pair may be created from the vacuum by an 
incident photon beam in the presence of a target to balance energy and 
momentum. This is the process mentioned above; a hole is created while the 
excited electron acquires a positive energy. 


Thus the theory predicts the existence of positrons which were in fact observed 
in 1932. Since positrons and electrons may annihilate, we must abandon the 
interpretation of the Dirac equation as a wave equation. Also, the reasons for 
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discarding the Klein-Gordon equation no longer hold. As we shall see, it actually 
describes spinless particles, such as pions. However, the hole interpretation is not 
satisfactory for bosons, since Fermi statistics plays a crucial role in Dirac’s 
argument. 

Even for fermions, the concept of an infinitely charged unobservable sea 
looks rather queer. We have instead to construct a true many-body theory to 
accommodate particles and antiparticles in a consistent way. This will be achieved 
by the “second quantization,” 1.e., the introduction of quantized fields capable of 
creating or annihilating particles. 


2-4-2 Charge Conjugation 


Hole theory implies the existence of electrons and positrons with the same mass 
and opposite charges which obey the same equation. The Dirac equation must 
therefore admit a new symmetry corresponding to the interchange particle 
antiparticle. We thus seek a transformation wy — y/ reversing the charge, i.e., such 
that 


(id — eA — my = 0 


(2-96) 
and (id +eA—m)W =0 


We demand that this transformation be local and that its square amount 
at most to multiplying w by an unobservable phase. To construct ¥/° we conjugate 
and transpose the first equation and get 


(= id, — eA,) — my" =0 


with wy? = y°7y*. In any representation of the y algebra there must exist a matrix 
C which satisfies 


GC = 7, (2-97) 

For instance, in the representation (2-10), C may be taken as 

0 —io? 
= jy?y® = 2-97a) 
C=iy*y ( eens 0 ) ( 
Sea) = C= Ct (2-97b) 
We then identify y° as 

Wancy (2-98) 


with 7, an arbitrary unobservable phase, generally taken as being equal to unity. 
In the present framework charge conjugation is an antilinear transformation. 
This is consistent with the hole interpretation, since when computing a transition 
probability the presence of a particle in a certain state will be represented by 
w and its absence by y*. Let us examine more closely the properties of this 
charge conjugation. We compute y for w describing a spin-down negative energy 
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electron at rest. In the absence of external field 


0 
i 0 —im 
wae | Wancy =ne™ 


Therefore, the charge conjugate of a negative energy spin-down electron is indeed 
equivalent to a positive energy spin-up electron. 

For an arbitrary solution y of energy momentum p polarized along n, we 
know that 


ep +m 1+ ysn 


°> 0 
2m 2 v P 


y= 


where ¢ = +1 denotes the sign of the energy. Since C commutes with ys = 73 
in the Dirac representation, 


Wo = CPT = Cy? (7 * my? ¥ Lt) ys 


2m B 


—eptm\/(1+yst\ ,. 
(ae ne 


W*° is described by the same four-vectors p and n, but the sign of the energy has 
been reversed. Using the notations (2-48), we have 


u(p, n) = n(p, n)v*(p, n) 
v(p, n) = n(p, n)u‘(p, n) 


where the phase n(p,n) may depend on p and n. We recall that the projector 
(1 + ys7)/2 projects onto spin states +4 along n according to the sign of the 
energy. Thus the spin is reversed by charge conjugation. 

We note, furthermore, that under a common transformation on the spinor 
w and the potential A, 


(2-100) 


—- = — sc 
poy =nCy (2-984) 
A, Ai, = —A, 
the Dirac equation (2-96) remains unchanged. 


The transformation law of the four-vector current under charge conjugation is 
— WynW >i. = Vy 
= "Cy, Cy = wr yy" 


We could naively conclude that j, = Wy,W =j,. However, we shall see in the next chapter that yw 
and w have to be considered as anticommuting operators (Fermi-Dirac statistics). Therefore charge 
conjugation will reverse the sign of j, and leave ej - A unchanged. 
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We shall explore the intricacies of the last discrete symmetry, namely time 
reversal, in Chap. 3, when we have a satisfactory formulation of particles and 
antiparticles. 


2-4-3 Zero-Mass Particles 


In Sec. 2-2-1, while constructing the spinor solutions of the free Dirac equation, 
we have discarded the massless case m = 0. However, the neutrinos are massless 
spin 3 particles. In addition, we expect that at very high energy massive particles 
behave as massless. Therefore we reexamine this case. We start from the massless 
Dirac equation 


pw=0 pz = i0, (2-101) 
Multiplying this equation by y°y° = —iy'y?y° yields 
Dy pw = py (2-102) 


since, for instance, (y°y°)y! = iy?y? = 673 = )"'. The chirality operator ys anti- 
commutes with p. For a positive energy solution 


W(x) = e *** Wk) 
Equation (2-101) requires k? = 0, thus k° = E = |k|, and Eq. (2-102) implies 
Yi ky = yy (2-103) 


Therefore the chirality equals the helicity (it is opposite for negative energy 
solutions). Let us label the independent solutions of (2-101) by their chirality: 


= e ™** 4, (k) 

W(x) = =: 
ysus(k) = +u+(k) 
ysvi(k) = +v.(k) 


with k? = 0, k° = |k| > 0 


Ome: 
In the usual representation y; = ( i 


jae 
_1( ash) a ( ay 
100)= op) re Fy ab (h) 
(HOS = 
and a.(k)=o-kas(k)=| ~ ‘ = io2a*(k) 
sin5 
—snz=e 
a_(k) = —9o°ka_(k)= = —ioz,a%(k) 
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where 0 and g are the polar angles of k. Similarly, 


a es i ue k = —4y k 
va) Ca) = (” (k) phe ant | git 
: _ 1 (b-(k) = io2a%(k) = ent 7 
v= CHE) = 7 4b it = —u_(k) 


There exists only two independent solutions, for a given k. 

Experimental observation shows that only negative chirality neutrinos exist. 
Neutrinos have the helicity — 1, antineutrinos the helicity +1. This will be better 
understood in terms of two-component spinors. Indeed the reason for using four- 
component spinors no longer holds for the massless Dirac equation 


ow 1 
iy Lone 
; ot i ike 
where the algebra 
{oi, Xj} = 264; 


may be realized by the three-two-dimensional Pauli matrices. The identification 
a; + o; leads to positive helicity, positive energy particles, whereas a; > — o; gives 
negative helicity. Such spinors, initially introduced by H. Weyl, were rejected 
because they were incompatible with parity conservation (which reverses the sign 
of helicity). This is not a serious objection any more since neutrinos are involved 
in weak interactions which do not conserve parity. 

We have already introduced in (2-20) the corresponding chiral representations 
of « matrices: 


pc 0 oat SP aa 5 Shee 
Bi meeo) eee)? Neer 
1) 
0 


(—p° + p-s)p=0 (2-104) 


For positive chirality, y> = +1, y= ( ) and y: pw = 0 reduces to 


whereas for y° = —1, Wy = (") and 


(p° + p-a)y =0 (2-105) 


In both cases, we have a two-component theory, and the Dirac equation is 
equivalent to the pair of Weyl equations. The so-called charge conjugation C 
(the neutrinos have no charge!) connects the two chiralities and changes the 
sign of the energy. There is no C invariance if nature uses only neutrinos of a 
definite chirality. Actually, since the parity operation P also connects the two 
types of solutions 
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w(t, x) aes y w(t, an x) 


(y° is antidiagonal), the combined operation CP leaves the Weyl equations in- 
variant. In the new representation, the matrix C of (2-97) reads 


c(™ ,) 
102 


Therefore, the CP operation acts according to 
WR(t, x) = nCW*(t, —x) = Finor*(t, —x) (2-106) 


for chirality y> = +1 respectively. 
We observe that the Lorentz invariant normalizations (2-43a, b) of the massive 
solutions have to be modified in the massless case. Then we shall write 


i(k) u!(k) = 2E Sag 
v(k)y°v(k) = 2Ebap 


and leave it to the reader to construct the appropriate plane wave solutions. 


2-5 DIRAC PROPAGATOR 


2-5-1 Free Propagator 


In Chap. 1 we developed the concept of Green functions of a classical scalar 
field. We will extend it here to spin 4 particles. We consider first free propagation. 

Let us try to determine the solution of the Dirac equation at time tz as a 
function of its value at an earlier time t,. This is possible since we deal with a 
first-order equation. We thus look for a kernel K(x, x;) such that 


W(t2, X2) = fen K (ta, X25 ta, X1)y° W(t, X1) (2-107) 


The introduction of y® will be justified soon. Any solution y is a linear super- 
position of plane wave solutions 


W(t x) = 2 |S 2 [a (k)u(k) eg * + BO*(kyv(k) e**] 


(«) 


Owing to the relations (2-43), we may write 


| d>x i(k) e~ &* pW (0, x) = a(k) 


and [ex vk) ef *y° WO, x) = DO *(k) 


a 
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Therefore, 
d°k m - Bi Pe 
W(t2, X2) = |S Ex [ex [ul(k) @ uk) e a) 
+ vk) @ (hk) el 2—*9] yw (t1, x1) 

Interchanging the order of integration, we find the desired kernel 
d*k m 
K (ta, X23 t1,%1) = |S E 
+ v(k) @ H(k) ek ""*"] for tz >t, (2-108) 


Notice that K depends only on (x2 — x;), which is a reflection of the translation 
invariance of the free equation. We may also use the projectors A.(k) of Eqs. 
(2-40) and (2-41) to recast K(x, x;) in a more compact form 


3 [u'(k) ® i(k) e7 ik: (2 — x) 


d®k 


2E(2n)? [(k ae m) e~ %* 2-*) 7: (kK a m) gh maa] 


(2-109) 


K(x2, X1) = O(t2 — t1) | 


~ 


We denote this retarded kernel by K;e. Let us show directly that it is a Green function of the Dirac 
equation. Acting on Kret(x2, x1) with (ig. — m) yields 
a°k 


2E(2n)> [ + m) efk-O2—*1) + ( — m) ei Oa—™) 7 


(i92 — m)Kret(X2, x1) = iy° (tz — t1) pe 


We may change k into —k in the second term of the right-hand side, the coefficient of (y:k + m) 
vanishes, and we obtain 
(if2 — m)Kret(x2, X1) = i5*(x2 — x1) (2-110) 


Kret(X2, X1) may also be expressed in terms of the scalar retarded Green function G,(x2 — x,) 
[Eq. (1-169)] as 


Kyet(X2 — X1) = —i(iG2 + m)Gyey(x2 — x1) 

Equation (2-110) also follows from the identity (1-165) satisfied by G,.,: 
(C2 + m?)Gree(x2 — x1) = 64(x2 — x4) 
From 
~ik+ x 
C= ~ aap |G TRE 

it follows that 
sco ther 
(ko + ie)? — k? — m? 


where the ko integration has to be performed first, along the dashed contour of Fig. 2-3. 


i P 
Kel) = Gn [an en iksx 


The hole theory suggests the introduction of a different Green function, the 
Feynman propagator already discussed in the last section of Chap. 1. It will 
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Figure 2-3 Contour of integration for the definition of Green functions. The broken line corresponds 
to the retarded propagator, the solid line to the Feynman one. 


appear in a natural way in the quantized field theory. Nevertheless, let us sketch 
the ideas that led Feynman and Stueckelberg to its construction. 
A Green function may be considered as describing three successive steps: 


1. Appearance of an electron at (t,, x;) 
2. Propagation of the electron from (t;, x;) to (t2, x2) 
3. Disappearance of the electron at (t2, x2) 


As long as the electron has a positive energy, this process is physically acceptable 
for tz > t,. On the other hand, if we deal with a negative energy electron, we 
would like to interpret its vanishing as the appearance of a positron, and vice 
versa. The second step should be then considered as the propagation of the 
positron from (t2, X2) to (t;, x,), which makes sense only for tz < ¢1. Therefore 
in the hole theory, we would like to construct a Green function which propagates 
the positive energy solutions only for t2 > t,, the negative energy ones (more 
precisely the positrons) only for t; > t2: 


Sela, x1) = | Son 
+ A(t; — t2)b(k — m) ef &2-*?7 
The constants a and b are determined by imposing that 
(id2 — m)Sp(x2, X1) = 54(x2 — x1) (2-111) 


[Notice the change of normalization with respect to (2-110)]. It follows from a 
straightforward calculation that 


[O(t2 — ty)a(k + m) e7 ik + (2—*1) 


d ice 
meee te 


— b(t — m) @~ H— m0] 


(ig. — m)Sr(X2, X1) = 10(t2 — ty) | 
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d°k om 
=110(tg—t1) lao eet) 


x y°[a(Ey® — y-k + m) — B(Ey? + y-k —m)] 
and the condition (2-111) is fulfilled provided a = —b = 1/i. The result is thus 


1( ser —— 
Sr(x2 —X,)= |e 2E(Qn) 5 (A(t. — tr)(K +m) ea ik: (27x) 


—O(t, — t2)(k — m) el 2-*97 (2-112) 


Kyet(X2 — X1) and iS-(x2 — x,) may differ only by a solution of the homogeneous 
Dirac equation. This is indeed what is found by a direct calculation: 


d®k 
2E(2n)° 
A covariant expression is obtained by means of the integral representation 


a(0) = lim | Ci als 


2-0, J|_, 2in @— i 


Kret(X2 aad X1) md iS r(x2 = Dap) = | (kK a m) elk *(%27%X)) 


The limit ¢ +0, will be understood in what follows. The quantities we are dealing 
with are distributions acting on smooth test functions. After insertion of this 
expression into (2-112), we get 


d°k 1 Sees “ dw iwt 
a (a 2E alc : ne exalts 


— (k ~m) =| & ei | 
-0 O— ie 


In the first integral we set p? = E — w, p= k[E = k® = (k? + m?)"/?] and in the 
second one p® = w — E, p= —k: 


d* SS HOs Ey® BIA OA: a 
site) = | pe ote m_ Ey ae 


(2n)* 2E E + p® — ie E— p®°—ie 
For a vanishing positive ¢, we may write 


(E + p® — ie)(E — p® — ie) = —(p3 — p* — m? + ie) 
Finally, 


d*p pt+m 

Sr(x) = e@ Pe * ——______ -113 

P( ) |e p* — m2 re (2 ) 

The ie term gives the prescription for the momentum integration. The inte- 

gration over p®° is performed first, along the solid contour shown on Fig. 2-3, 

and then we integrate over p. If we consider m as complex m, = m — ie, e>04, 
we have 
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p +m pP+m, 1 1 


Therefore, we may write the Fourier transform of S;(x) as 


. (2s 


p?>—-m+ic p—mtie oi 


Notice the relationship between S,(x) and the Feynman propagator of the scalar 
field (1-178): 
Sr(x) = —(id + m)Gr(x) (2-115) 


To summarize, the role of the Feynman propagator is to propagate the positive 
frequencies toward positive times and the negative ones backward in time. 

Let W'*(ty,xo) and w'?(t2,x) be the positive and negative frequency com- 
ponents of a solution, given for t; and f2 respectively, t; < tz and all x. The 
propagator S- allows us to find W(t, x) at intermediate times t: 


wit, x) =1 [er [Sr(t at Se, ae yyw (ts, y) 


= SHt—t,x—y)) W(t, 9) (2-116) 


2-5-2 Propagation in an Arbitrary External Electromagnetic Field 


In practice, we deal with the propagation in the presence of obstacles: diffusion 
processes, external fields, interactions with other particles. Let us treat the propa- 
gation in an external electromagnetic field 


[ida Se ANXs) — m| Sa(x2, X1) = 6*(x2 — x1) (2-117) 


With very few exceptions, we are unable to find a compact expression for Sy. 
Fortunately, it frequently occurs that the term eA is small enough to be treated 
as a perturbation, and S4 may be expressed as an (asymptotic) expansion in 
eA. To derive it, we multiply both sides of (2-117) by Sp(xs, x2) and integrate 
over X2: 


Sr(x3, X1) = | dts Sr(X3, X2) [id — eA(x2) — m]Sa(x2, x1) 


= | atx Sr(x3, xa)[—i de — @A(x2) — m]S4(x2, x1) 


From (2-111), it follows that 
Sp(x3, x2)(—i@e — m) = 6*(x3 — x2) 


Therefore, the integral equation which determines S, reads 


S4(X3, x) = Sr(x3, el a e | ats A(X2)S 4(X2, x1) : (2-118) 
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S'4 (xp, x;) Sp (xp ~ X)) 
o—— > "2 = e—_—_-® + ®—*-® + ee + --- 
x; Xf xX; xX xX xy xf xy X47 


Figure 2-4 Diagrammatic representation of the perturbative expansion (2-119). The solid line between 
x, and x, stands for the propagator S-(x,; —, x;), the cross for e A(x). 


Equation (2-118) is adapted to a perturbative expansion obtained by iteration: 


Sa(xp, Xi) = Spx, Xi) + | ate Sr(xy, x1)eA(x1)S (x1, xi) 


or [ete d*x2 Sp(xp, x1 )eA(X1)Se(X1, X2)eA(X2)SHX2, Xi) + °° 


(2-119) 


This expansion is depicted diagrammatically on Fig. 2-4. 


Introducing the Fourier transforms.of S$, and A(x): 


d*p; d*py _, Ae 
es ae | \ (2n)* at ee tr A Spy Di) 


— ap —itp-x 
A,(x) = |e A,(p) e 


(we use the same notation in configuration and momentum space for the sake of simplicity) a per- 
turbative expansion of S(py,, pi) may also be written. Since S-p(x,, x;) = Sp(x¢ — x;) is translation 
invariant, 

Sr(py, pi) = (2n)*5*(py — pi)Sr(pi) 
where S;(p) has been given in (2-114) and 


Salpy, pi) = Sr(ps)(2n)* 5*(py = /i})) SF | er Sr(ppeA(p1)S Api)(2x)*6*(py = jon = Je), 


ar {er d*p2 Sr(ps)eA(p1)Sr(p2 + pie A(p2)Sr(p;)(2n)*5*(pp — ps1 — p2 — Did) 


tee 


2-5-3 Application to the Coulomb Scattering 


Coulomb scattering will serve as a testing ground of the propagator method. 
The process under study is the scattering of a charged electron of mass m by a 
center with charge — Ze and infinite mass. The latter creates a potential Ap = 
—Ze/4nr, A = 0, where r stands for the vector joining the center to the charge. 
In classical nonrelativistic mechanics, the trajectories are hyperbolas. The 
“scattering angle 6 is related to the impact parameter b (see Fig. 2-5 for notations) 
by 
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1. Geometrical relations 


2. Energy conservation 


(pa is the momentum at point A.) 
3. Angular momentum conservation 


|= p;b = prb = pa(c — a) 


After elimination of p;, p4, c, and a, we get 


Za 


i 2e tan 6/2 ey 


We consider a uniform flux of electrons, of density p and incident velocity v = 
pi/m = py/m. The number of scattered particles in the solid angle dQ = 22d cos 0 
per unit time is equal to the number of incident particles on the ring of area 
2b db, namely, 


dN Za\? dQ 
“ie lea al dg (=) sin* 82 


Therefore, the differential cross section, defined as the ratio of dN/dt dQ to the 


incident flux, is 
do_ 1 (Za) _ 42?%a?m* (2-121) 
dQ 4sin* 6/2 \2e) ~—_——|q|* 


Figure 2-5 Coulomb scattering off a charge located at F. 
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Here q =p, — p; is the momentum transfer and |q| = 2p; sin 6/2. This is the 
classical Rutherford formula. 

We turn to the relativistic quantum case. We shall use the expression derived 
in (2-116), after substituting S4 for Sp and taking the following boundary con- 
ditions: for t; = —0oo, w'*(t,, x) is an incident plane wave of positive energy 
electrons, whereas for t) = +00, W'~ (tz, x) vanishes. Since we do not know Sa, 
we content ourselves with the first two orders of the perturbative expansion 
(2-119). Let w,,(t, x) be the solution of the free Dirac equation which reduces 
to the incident wave when t > t; = —%. According to (2-119), the perturbative 
wave function reads 


W(x) = Winclx) + Waiee(X) (2-122) 
where 
Weaire(X) = ; lim i fey [ate Sr(x — Zee) 5 (2) — V)? Winelt1, y) 
= (2-123) 
and y=(t1, y) 


Since for z° > ty, 


[ey S(z — yyy Winc(t1, y)= Winc(Z) 


we have 
Wai) = | ats Sr(x — z)eA(z)Win-(Z) 


As x° tends to +00, Wai behaves as a pure positive energy solution of the free 
Dirac equation. Indeed, using the expression (2-112), we see that only the first 
term contributes in this limit, and we get 


ae 3 
Wain) -| £4 e [ete aa els ie A(z)] Winc(Z) 


m 1/2 ; 
a 2 (fz) e Ay) (Ds )S 5: (2-124) 
i: 


where S/:= —ie [ate (a) a (pp) A(z) e!?s 7 W;(z) (2-125) 


In the second expression of (2-124), the sum J d>p/(2n)? has been replaced by a 
sum over the final states in a finite space volume V: 1/V ee ie k,,) VCS 
frm 


wave (m/VE)*/* e~'?’*u(p) describes a particle of velocity p/E and polarization 
a in the volume V. Therefore, the transition amplitude between the state 


1/2 
Wiles) = (Fz) eW i= ylA(p,) 
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and an analogous state w(x) is 


: m — { oy i at eH = 
Si = -1ée Teer | a: i! (pr) A(z) e (p, Pi) u)(p;) (2-125a) 


For the Coulomb problem, 4A = (do, 0) and Ap = — Ze/4nr; therefore, 


—ig’r 
ee gy | ars — ap) %p,) (2-126 


It is gratifying to recover the energy conservation. We finally need the Fourier 
transform of the potential 


—iqee Ag 
d°r sd = —~ =p,— DP; 
| ; Iq/? q=p;-—P 


We are using stationary plane waves instead of wave packets; no wonder, then, 
that the amplitude (2-126) cannot be squared. This is remedied if according to 
Fermi’s golden rule we consider a finite time interval and replace the 6 function by 
oo T/2 
2nd(Ey = E;) = | dt elE,-E)t _, | dt elE,— Edt 
io = FHI) 
a 2 sin [ T(E; = E;)/2] 
7 E, — E; 


The square of this expression behaves for large T as T276(Ey — E)). 
The transition probability between states i and f per unit time and per 
incident particle is then 


aP y; iZ o m 4n i 0,,(B) 2 _ FB. d* py 22107 
fo | V (Bae)*"* ial? u(ps)yu (pi)| 2nd(Ey ENV On)? 


The summation runs over all possible final states, the number of which in the 
momentum volume element dp, is V d>p,/(2m)*. Dividing by the incident flux 
(1/V)(\p;|/E;), we obtain the differential cross section 


472 PEE? 
dosi= (oes 5(Ey — E,)p} dps dQ,|a(pp)yu (P|? 
|p:|E,|q| 


We use |p;| = |py| = py and py dp, = Ey dE, to perform the trivial pr integration 
AZ? a? m? 
la|* 


In the nonrelativistic limit, 7y°u is proportional to 5%. If we do not observe 
the final polarization, a sum over a has to be performed, whereas for an un- 


dos: = |a(p,)y°u(p))|* dQ, (2-128) 
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polarized incident state we average over the two equally probable polarizations p: 


— = i! (p you (pi) | 
dQ unpolarized — “ial 
4Z* 07m? | oPtm ,bptm 
<i ngemeineg T gor ae 2-129 
lq/* 2 o( 2m’ Im { 


Once again, the expression (2-40) has been used. We now need identities for 
traces of y matrices. For any product of an odd number of 7 matrices, the traces 
vanish. For an even number, the following identity may be proved by induction: 


tr (did2°* don) = G1 * G2 tr (3 °** dan) — Ay ° 3 OE (d2da*** dan) +°°° 


Here this reduces to 


tr ppv Be = 4(E:Es — pie pp + EiEs) 
tr y°y® = 4 


We also need the kinematical relations 


~ 


Eo ee pit py = E? — p? cos 0 = m? + 2E6? sin? 5 
where f = v/c = |p|/E is the incoming (or outgoing) velocity and 
4 4aqj 49 Ps a PIs 7B 49 
\q\* = 16|p|* sin at 16f7E*p” sin : 


The final expression of the unpolarized cross section (Mott cross section) is 


aR 

ae Set ea = a (: — pein? 5) (2-130) 
dQ unpolarized 4p B = 6/2 2 

When f — 0, this reduces to the Rutherford formula. Also notice that the relativistic 

correction (1 — f? sin? 6/2)(1 — B?)~! affects predominantly the backward 

scattering. 

This result has been derived for incident electrons. Let us discuss briefly 
the scattering of positrons in the same Coulomb field. The attractive Coulomb 
force is now replaced by a repulsive one. In classical nonrelativistic mechanics, 
this leads to the same Rutherford formula (this remarkable result is a peculiar 
feature of the Coulomb field). In our quantum treatment, we know that the theory 
is invariant under charge conjugation. The scattering of an electron off a charge 
— Ze is the same as that of a positron off a charge +Ze. On the other hand, 
to lowest order, the cross section is an even function of Z. Therefore, the Mott 
cross section is also valid for positrons. 

We may verify this by a direct calculation; we shall rather use the hole 
theory interpretation. An outgoing positron of four-momentum p, and polariza- 
tion « is represented by an “incoming” negative energy solution running back- 
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ward in time: 
Wx) = elPr Xp 1p ,) peat 
We thus return to Eq. (2-116), using this time the boundary conditions 
WO (ty, x) = 0 ty = —0Oo 


y(to, x) = e!Ps* y'— (py) x9 = t2 = +00 


Repeating the steps which led us from (2-122) to (2-125a), with due attention paid 
to the signs, we get 


(er [ats (FE) a’ (p,) el? P)*2 A(z)y'— (p,) 
fi pe E; E, i i 
This is in agreement with a direct computation. The incoming positron is described 
by the wave function 
Vilz) = e7 FF Up) = Cem FP T(,) 


(up to a phase). According to Eq. (2-125), the scattering of these charge —e 
positrons will be described by 


Syi = te {ats W5(z) A(zwi(z) 


I 


m? A/a eT = east Pee erat 
=) = d*zy-” 1 4Cp'— 8)" (p;) ell?s— Pd * 2 
ie (a -) | Zv (pp)C * AC (p 


m2 ii ’ 
=aie (a) fat el (pdudo*'(p,) eerere* (2-131) 


It is now clear that this expression leads to the cross section (2-128). 


We let the reader discuss the polarization effects or the corrections due to the recoil of the nucleus. 
We shall simply recall the useful concept of a form factor. Suppose that we study the scattering by 
a finite charge distribution, a finite radius nucleus, instead of a point-like charge. The spherically 
symmetric distribution — Ze p(r) will be normalized according to 


{ ar=1 
As in (2-126) the cross section to lowest order in @ is proportional to |7(q)|?, where V(q) is the 


Fourier transform of the potential V(r). From the Poisson equation AV = —p, we learn that V(q) 
is related to the form factor p(q): 


(ag) = [oe aa: 
by V(q) = (1/q7)p(q). Therefore, the cross section has to be amended to read 


iaial? 2132) 
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2-5-4 Fock-Schwinger Proper Time Method 


As a complement to the previous study of propagators, we shall present the beautiful method intro- 
duced by Fock and Schwinger and use it to give exact expressions of the Dirac propagator in two 
particular configurations of the external electromagnetic field: constant uniform field and plane wave 
field which we have already encountered in the classical theory. 

Suppose that we seek for the Green function G(x, x’) solution of 


H(x, i8,) G(x, x’) = 64(x — x’) (2-133) 


where H is a polynomial in 0,. The idea is to consider H as an hamiltonian that describes the 
proper time evolution of some system. The previous equation is nothing but the definition of the Green 
function of H(x, p) in the x representation 


<x|x’> = 6*(x — x’) 


(2-134) 
[x*p’]=—ig” p= idz 
We introduce the unitary evolution operator U(x, x’; t): 
r) 
ia. Wibe 5.3 Byes leles jp VUACe, x3 5a) (2-135) 
Tt 
with the boundary conditions 
e lim U(x, x’; t) = d4(x — x’) (2-136a) 
lim U(x, x’; 1) =0 (2-136b) 
We have 
U(x, x’; t) = <x| ex’) = <x|U(2)|x’> (2-137) 
0 
G(x, x‘) = —-i | dali a) (2-138) 
Equation (2-135) may be rewritten as 
id, <x|U(t)|x’> = <x|H(x, p)U|x’> = ¢x|U(t)UT (0) A(x, p)U(2)|x’> PE 


or 18,<x(z)|x'(0)> = <x(z)|H(x(2), p(z))|x’(0)> 


with obvious notations. In favourable instances, we are able to solve for x(t) and p(t) showing the 
relation between quantum and classical treatments. We then express H(x(t), p(t)) as a function of the 
suitably ordered operators x(t) and x’(0): 


<x(t)|H (x(t), p(z))|x'(0)> = F(x, x’ 4) <x(2)|x"(0)> 


Then the equation for U(x, x’; t) becomes an ordinary linear differential equation which may be 
integrated as 


Uixsxest)i— exp |- | he! Jala, see Jets, x’) 


where C(x, x’) has still to be determined in order that U(x, x’; t) satisfies the correct relations between 
position and momentum; for instance, 


[idz, — €A,(x)] <x(z)|x'(0)> = <x(t)|22,(t)|x(0)> 


= le 2-140 
[tee eA,~)] <x(t)|x’(0)> = <x(z){7,(0)| x(0)> Ty = Py — CAy (2-140) 
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In the case of a Dirac particle interacting with an external electromagnetic field, we have to 
solve 
[i9 — eA(x) — m]Sa(x, x’) = 54(x — x’) 
Ga(x, x’) defined by 
Sa(x, x') = [i¢ — eA(x) + m]Ga(x, x’) (2-141) 


satisfies 
HG,= co — eA)? — m? — 5 cw Nats x’) = 54(x — x’) (2-142) 


In the Heisenberg representation, the operators x(t) = U'(t)xU(t) and x(t) = U'(t)xU(t) satisfy the 
Ehrenfest relations 


dx,{t) 
x et, so) —— 20 
(2-143) 
d 
= = i[H, n,] = —2eF,,n° — ied? F,,— 5% Fo” 
since [7,, ty] = —ieF yy. 
Consider first a constant field. The previous equations reduce to 
axy dn 
—=-2 —- = —2eF,,n° 2-144 
dt ae dt si ( 


These equations are readily integrated, using matrix notations 


n(t) = e~ 2¢F*2(0) 


—2eFr 1 
x(t) — x(0) = () (0) 


We then compute z(t) as a function of x(t), x(0): 
n(t) = —4eF e~ °F*[sinh (eFt)]~ ' [x(t) — x(0)] 
and using the antisymmetry of F: 
x(t) = [x(t) — x(0)] KLx(x) — x(0)] 
where K = te’ F?[sinh(eFt)]~? 


Rearranging this expression involves the commutator [x(z), x(0)] : 


7 2eFt ai | 
[x,(t), x (0)] = (—),, 


We finally get H as 
: P 
H = x(t)Kx(2) — 2x(2)Kx(0) + x(0)Kx(0) — 5 tr [eF coth (eF1)] — ew" —m? 2-145) 
and the differential equation satisfied by U(x, x’; t) is 


id, U(x, x’; 1) = 1-5 Oyy FY — m? + (x — x') K(x — x) — tr[eF cotanh cero} huts ee 
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The latter may be integrated to read 


U(x, x’; t) = C(x, x’)t~? exp {—4trIn [(eFr)~* sinh (eF 2)]} 
x exp F (x — x')eF coth (eFt)(x — x’) + > Oyy Ft + in? 


The function C(x, x’) is determined by Eqs. (2-140) which lead to 


ebay 


E — eA,(x) — (x — x Joes sj 0 


| ~i08 — eA,(x') + v (x - xy [ets x= 10 
The solution has the form 
Coex) —IC& vexpr—ie | a - [A(é) + 4F(E — x’)] 


Since A,(&) + 4F,,(€ — x’)” has a vanishing curl, the integral is independent of the path of integration. 
Taking a straight line from x’ to x, the second term does not contribute, owing to the antisymmetry 
of F,,,, and we may write 


‘Che x’) = C exp —ie | a + A(& 


The constant C is finally determined by (2-136a) as 
I 
(4x? 
In summary, the propagator in a constant field reads 


0 


Sa(x, x’) = [i9. — eA(x) + m](—i) ' dt U(x, x’; Tt) (2-141a) 


—=co 


with 


_j x 1 
Wks so a= aero? jn" | dt" A,(é) — 5tt In [(eFt)~* sinh (eFt)] 


j . 
a 4° — x')eF cotanh (eFt)(x — x’) + > OyyF st + i(m? — ior} (2-146) 


Notice the presence of the —ie term [in order to fulfill (2-136b)] and of the phase factor 
exp —ie J d¢- A. Its role is to make U gauge covariant; when A,(x) -> A,(x) + 0,A(x), 


UU it) he EMS x") 2) eM) 


We now turn to the plane wave case. The calculation is quite analogous to the previous one, 
and we merely sketch the successive steps. We consider a linearly polarized plane wave and use the 
same notations as in Sec. 2-2-3: A, = ¢, f(€) with € = n-x, n? =0, Fyy = buy f'(G) where $,, = Wey = 
n,é,. Notice that 0°F,,, = 0. Therefore Eqs. (2-143) take the form 


2-147 
dn, (2-147) 


Ge = ~ deb uar?h (8) — 5 nub po” f"(2) 
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Remarking that (d/dt)(x-n) = 0, dé/dt = —2n-nand [£,d&/dt] = 0, we first solve for ¢: 
E(t) = n> x(t) = €(0) — 2n-nt 
Then we integrate the equation for ¢’*x, and obtain 
on, = en*f(t) + C” 
where C” is a constant operator which commutes with z-n. Inserting this expression into (2-147) 
and integrating it, we get 
1 dx,(t) 
2dr 


= ei) x [2ec,f10) fe Os Pull 2(6) to = Mab nf" @| + Dy 


Here D, 1s a new constant operator commuting with z-n. We then calculate x(t) — x(0) and eliminate 
Die 
xyle) ~ x,(0) a8 


42 
ia [é(x) — €(0)]? cP 


T(t) ae 


d¢ [2<cos (+ en, f+ 5 5 uP wof | 


a. {reCof [é(x)] + e?n, f?[E(x)] + ~ 5 db oo ?"f" tan} 
This allows us to express the constant C,, as 


en, i 
7 AO ly of(0) 


Cu= ~ 5 bapL ee) - x40] - 
After computation of the various commutators 
[é(z), x,(0)] = 0 
[¢(0), x,(t)] = 2in,t 
[xu(c), x*(0)] = —8r 
we may write the hamiltonian H as 
H= a [x2(x) — 2x(t)+ x0) + x?(0)J— = — (e? <6f?> + m?) — ; po” eel (2-148) 


<df?> denotes the quantity 


eo def 7(0) -|{- dé f(¢) [ 
6f? eeene RL 
“oe | &(c) — E(0) ao) §(t) — ¢(0) 


The evolution operator has the form 


oath HOA) 


, Sey} 
Ure 7) = Cys) exp i E i + t(e*<df?> + m?) + e > a 


where the function C(x, x’) is again determined by the relations (2-140). We find 


Bye ea ("” aslo 
Cope = CO jiexp (-i [. dy, \4"0)- een ae -so-n}}) 


where the integral is path independent. For a straight line, the only remaining term in the phase is 
exp —ie J% dy, A“(y), and we find, finally, 


i (x — x’? 
U(x, x5 tT) = (4x nyt aaron! | 


-e |" as, aro} ao | (2-149) 


oa AS)) es 
= | eer?» +m + ; 6,0 = a 
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This result recalls the classical result of Sec. 1-1-3. For a periodic function f(€), the term proportional 
to d,,0°" is smeared out if we average over a few periods. The net effect is a mass shift 


mag = m* + e?<df7)> = m? + e? f? 
Such a nonlinear effect is hard to detect; very high intensity beams are required, since for a mono- 
chromatic plane wave of frequency w/2z = c/A and of energy density 6 = E* = f°w* = phw, we have 


Am? ef? 


2 2 2 


== 4noa? PS — 2022p 
m m 6) 


Here %, is the Compton wavelength of the electron and p is the number of photons per unit 
volume in the incident beam. At present the most powerful laser beams do not enable us to reach a 
sizable value for this ratio. 


NOTES 


The material of this chapter is very standard and we could hardly escape rephrasing 
what is found in many excellent textbooks. See, for instance, J. D. Bjorken and 
S. D. Drell, “Relativistic Quantum Mechanics,” McGraw-Hill, New York, 1964, 
and also A. Messiah, “Quantum Mechanics,” vol. 2, North-Holland, Amsterdam, 
1962. Hole theory is beautifully described by P. A. M. Dirac in his Solvay report 
of 1934, reprinted in “Quantum Electrodynamics,” edited by J. Schwinger, Dover, 
New York, 1958, which contains many of the fundamental papers on relativistic 
quantum field theory. For the propagator approach see R. P. Feynman, Phys. 
Rev., vol. 76, pp. 749-769, 1949, and for the proper time method, J. Schwinger, 
Phys. Rev., vol. 82, p. 664, 1951. The original solution for a Dirac electron in a 
plane wave is due to D. M. Volkow, Z. Physik, vol. 94, p. 25, 1935. For a review 
on the Klein-Gordon equation see H. Feschbach and F. Villars, Rev. Mod. Phys., 
vol. 30, p. 24, 1958. The nonrelativistic limit of the Dirac equation is studied 
in L. L. Foldy and S. A. Wouthuysen, Phys. Rev., vol. 78, p. 29, 1950. An 
overall view on the bound-state spectrum is given in H. A. Bethe and E. E. 
Salpeter, “Quantum Mechanics of One and Two Electron Atoms,” Springer- 
Verlag, Berlin, 1957. See also M. E. Rose, “Relativistic Electron Theory,” John 
Wiley, New York, 1961. The interesting semiclassical interpretation of the Lamb 
shift by T. A. Welton is in Phys. Rev., vol. 74, p. 1157, 1948. For the two- 
component neutrino theory see R. P. Feynman and M. Gell-Mann, Phys. Rev., 
vol., 109, p. 193, 1958: 

An account of relativistic quantum kinematics is given by P. Moussa and 
R. Stora in “Analysis of Scattering and Decay,” edited by M. Nikolic, Gordon 
and Breach, New York, 1968. We refer to this work for complementary details 
on the representations of the Poincaré group and their applications. 


CHAPTER 


THREE 
QUANTIZATION—FREE FIELDS 


Quantization is presented in the canonical way and applied to free fields. Starting 
from a mechanical analog we discuss in turn the neutral and charged scalar field. 
The Gupta-Bleuler indefinite metric is used in the electromagnetic case with 
emphasis on gauge invariance and the problems arising from the vanishing 
photon mass. The vacuum fluctuations are beautifully evidenced by the Casimir 
effect. For Dirac fields, the stability of the vacuum and the exclusion principle 
lead to quantization according to anticommutation rules. This implies the con- 
nection between spin and statistics and the PCT theorem. 


3-1 CANONICAL QUANTIZATION 


In the nonrelativistic case we obtain Schrédinger’s equation by replacing classical 
observables by operators and Poisson brackets by commutators. As usual, con- 
jugate momenta are derivatives of the Lagrange function with respect to velocities. 
We have seen in Chap. | that the classical procedure can be extended to infinite 
systems if discrete indices are replaced by continuous ones and the Kronecker 
symbol by a Dirac 6 function. We shall therefore boldly generalize quantization 
to this case without worrying too much at first. The only point which we should 
be cautious about is to preserve Lorentz invariance. In the specific case of electro- 
dynamics the lagrangian is known, at least classically. The precise role of the 
lagrangian after quantization will have to be clarified. This will be done in Chap. 9, 
where we shall investigate an alternative and fruitful interpretation of quantization 
using the methods of path integrals. 
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It is clear that the correct interpretation of a quantum theory depends on 
its dynamics (which can lead, for instance, to the appearance of bound states, 
etc.). Unfortunately, in most cases we cannot solve the equations of motion and 
we have to treat interactions using approximations. This is a major difficulty 
which sometimes prevents us from reaching a logical and coherent presentation. 
Part of this difficulty can be overcome by renormalization which hides some of 
the interactions by reexpressing the observable quantities in terms of measured 
masses and coupling constants. 

A more consistent but difficult axiomatic approach was pioneered by Wight- 
man. One tries to develop a quantum field theory from a few well-defined axioms 
obtained from an idealization of physical requirements. To uncover the content 
of the theory requires sophisticated mathematical developments. Two main lines 
of attack have been pursued. According to one of them, one studies Green func- 
tions in detail: analyticity in momentum space, algebraic properties, discon- 
tinuities. At an ultimate stage one may reach a scattering theory after proper 
identification of the asymptotic states. A more ambitious program involves the 
explicit construction of the fundamental operators, such as the hamiltonian, in 
order to study their spectral properties. The latter should provide the particle 
interpretation, bound states, scattering states, etc. 

Our goal will be more modest and follows partly the historical path. It 
combines approximations, physical intuition, and mathematical deduction. One 
may hope that these various methods will at some point merge together and that 
some rigorous basis will be given to the above treatment. 

At first sight a field theory does not seem to have an interpretation in terms 
of microobjects identified as particles. The long story of the wave-particle duality 
in the theory of light is a testimony of the confusion which may arise and still 
remains in certain aspects up to this day. One of the triumphs of the quantum 
theory will be to give a better understanding of this phenomenon. Naive pictures 
derived from classical experiences have limitations, however, arising in particular 
from the concept of indistinguishable particles. The structure of the Hilbert space 
of states constructed for free fields—the Fock space—will reflect this aspect. 

We shall only deal here with local theories. The meaning of locality will 
become clearer as we go along. It assumes an idealization of space-time measure- 
ments in arbitrarily small regions. As required by Lorentz invariance and a weak 
form of causality, measurements separated by a space-like interval cannot influence 
one another. In other words, (1) local observables exist and (2) local observables 
relative to space-like separated regions commute. Experimental verifications of 
this postulate are only indirect since we do not possess any apparatus comparable 
to nuclear or subnuclear sizes. In this case, measurement necessarily involves 
objects of the same nature as those under study. Nevertheless, it is a fair statement 
to say that no violation of this principle has been observed down to distances 


he|,/'s where s is the squared center-of-mass energy of particle collisions in present- 


day accelerators (/s < 60 GeV at the CERN ISR). These lengths (~ 107 +> cm) 
are minute fractions of atomic sizes (10~ ® cm). 
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Among the striking consequences emerging when combining microcausality 
_ with relativistic invariance, let us quote the spin statistics relation (half-integer 
spin particles are fermions, integer spin ones are bosons) and the existence of a 
TCP invariance. The latter involves a product of time reversal (T), parity (P), 
and charge conjugation (C). This implies the existence of antiparticles with the 
same kinematic invariants as their particle counterparts and opposite additive 
quantum numbers (electric, baryonic, leptonic charges, etc.). 


3-1-1 General Formulation 


Let ~,(x) be the fields whose dynamics we propose to study. The index « stands 
for internal characteristics (charges, etc.) or kinematic ones (such as Lorentz 
indices). For the moment we assume these fields to be free of constraints (which 
would reduce the number of degrees of freedom). We provisionally set aside 
half-integer spin fields, the correct treatment of which requires special considera- 
tions (Sec. 3-3). 

From the classical Lagrange function at a fixed time 


L()= [ex L(y, 0g) (3-1) 


we derive the conjugate fields 
OL(t) OL 
d[Oo@at,x)] OL0oge(t, x)] 


To construct the hamiltonian operator H we first replace the c-number fields by 
operators satisfying canonical equal-time commutation rules: 


[a(t, x), melt, y)] = 15p5°(x — y) (3-3) 


with the commutators [¢, g] and [z, x] vanishing. After inverting (3-2) to give 
Oog in terms of z and g we obtain H as 


T(t, X) = (3-2) 


H= | d>x b Malt, X)OoPalt, x) — L(g, co) | (3-4) 


This procedure suffers from the usual drawbacks arising from the operator 
ordering. Moreover, the multiplication of operator fields at the same point will 
lead to new difficulties, as we shall soon realize. The two aspects are related. 

It must be stressed that when writing (3-3) we have not yet specified in which 
Hilbert space these operators act. This question has a simple answer in the case 
of free fields, as we shall see, and hence is bypassed when studying small per- 
turbations around this situation. It is, however, entirely nontrivial in the general 
case, where its answer requires some knowledge of the dynamics. This is why the 
latter has some bearing on the very construction of the theory. ; 

To be specific, let us assume that we deal with only one real field g in the 
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classical picture—hence an hermitian g in the quantum one —with a lagrangian 

L£ =4(d9) — Vig) (3-5) 

where V is a smooth function (a polynomial, say). The classical equations of 
motion read 

CF, OF = ii dV() = 


Ue ae 3-6 
“0(0,9) op ° dp — 


If V reduces to a quadratic term V(g) = (m?/2)¢7, 
L(~) = 36) — 4m’ g? (3-7) 
from which follows the Klein-Gordon equation 


(0 +m’)p=0 (3-8) 


to be interpreted here as a classical field equation and not as a relativistic 
generalization of Schrdédinger’s equation. For an arbitrary V(@), that is, for an 


arbitrary self-interaction without derivatives, the conjugate momentum 7z 1s given 
by ~ 


Of 
i 
(Cog) 


so that the hamiltonian, expressed in terms of @ and 7, is 


dop (3-9) 


ae | #Px Sn? + (Vo)?] + VO) (3-10) 


and in the simple case of a quadratic V(@) given by (3-7) 
H= [ex 3[n? + (Vo)? + m7] (3-11) 


We have here no problem of operator ordering. Hence the only source of trouble 
may come from multiplying operators at the same point. Let us stick for the 
time being to the case (3-11). We recognize a simple structure of coupled har- 
monic oscillators. In order to uncover its physical interpretation, let us pretend 
that space has only one dimension and that instead of assuming continuous 
values the coordinate x can take only discrete ones which are integral multiples 
of an elementary length taken as unity. In this case (3-11) would be replaced by 


+0 
H = Y [ah + (On — On-1)? + m2] 


3-12 
[Ons Pn’] = [ttn Mn] = 0 [ Pn» Tn’ | == inn’ ae 


A physical model which could be described by (3-12) would be the vibrations 
of a one-dimensional “crystal,” with @, standing for the displacement of the nth 
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atom and z, the conjugate variable. Each individual oscillator with restoring 
force provided by the m?@; term would be coupled to its nearest neighbors 
~ through the (Qn — @n+1)* contributions to the potential energy. To study such a 
model it 1s natural to search for the proper modes. Using the discrete translational 
invariance of H this leads us to introduce Fourier transforms in the form 


+x +r 
m= | aa.) m= | Baa) 


Be Pe: Se oT 
2 Z : (3-13) 
@'(k) = @(—k) #*(k) = 2(—k) 
The commutation rules assume the form 
p(k), o(k’)| = [z(k), z(k’)| = 0 
[0.6K] = LRU, HC] ae 


eatk), z(—k)) =i > eo *** 


The last sum is also 22 » O(k — k’ + 2nn) which means that if @ and 7 are 
extended as periodic functions of k this reduces to 276(k — k’). By virtue of (3-13) 
the hermitian hamiltonian is now expressed as 


5 a x ff "(k)it(k) + @(k)[m? + 2(1 — cos k)] (k)} (3-15) 


To identify this with a sum of individual uncoupled oscillators we set 
= @_,=./m* + 2(1 — cos k) 


[o,@(k) + ix(k)] 


1 
./4nw, (3-16) 


1 2 ai 
at = —— [o.6'(k) — iz"(4)] 
4nQ, 
[ ay, ay] = 0(k — k’) 
The operators aj and a, create or destroy the mode k with energy w,. The 
hamiltonian H reads 


+n +8 
ft Z | dk (ala, + a,af) = | dk Hy (3-17) 


while the fields are expressed as 


(a, ete aes aid 


ie dk 
Pn = lhe 
=: 
-i|” dk i erie) (3-18) 


= o ikn 
ase =O oo ieae } 
ot be J 4m, ah 


Tn 
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The denomination of a, and af as destruction and creation operators is justified 
by the fact that if |E> is an eigenstate of H with energy E we have 


Ha, |E> = Ea,|E> sP [H, ay | |E> = (E = (Wy) Ax |E> 
Haj |E> = Ea |E> + [H, af] |E> = (E + @,)al |E> 


The mode k of energy «, is interpreted in this mechanical model as a coherent 
quantized vibration of the lattice atoms, or phonon. Here we understand clearly 
the relation between particles (phonons) and fields (@, is the displacement of the 
nth atom). Our physical intuition might lead us first to consider states of the 
crystal characterized by a wave function obtained by diagonalizing the field 


ly>=|{y}> aly? =yaly? 


We shall, rather, choose to generate the states starting from the ground state 
|(0> and its excitations in terms of phonons. We encounter at once a difficulty 
since the ground-state energy, the lowest eigenvalue of the hamiltonian, is in fact 
infinite. Each mode k contributes an amount $a, to the zero point energy. This 
is in agreement with the uncertainty relations since each oscillator has a minimum 
momentum spread due to its potential energy. Since we have a continuum of 
modes, each slice (k, k + dk) leads to an infinite energy—a fact to be traced to the 
infinite size of the system. Indeed, since 


(3-19) 


a,|0> = 0 <0| aj = 0 (3-20) 
for any k, we have 
1 +9 1 +n 


and the last integral is meaningless since [a,, aj] = 5(0)!. If, however, the crystal 
is of finite size (-N<n<N) let us take periodic boundary conditions by 
identifying the sites n and n + p(2N + 1), realizing a circular arrangement. The 
wave vector k would then be restricted to values k = [22/(2N + 1)]q with q an 
integer and —-N <q <N, and the zero point energy (N/2N))\ Nw, would be 
finite. Clearly (1/2N) }'"X @, has a finite limit as N tends to infinity, showing that 
the above energy is indeed proportional to the size of the system. 

By taking a discrete instead of a continuous model, we have introduced a 
Brillouin zone —z < k < xin momentum space, equivalent to an ultraviolet cutoff 
in the original model. The reader will recall that the origin of the expression 
“ultraviolet catastrophe” stems from the divergent contribution to the blackbody 
thermal radiation of high-frequency modes of the electromagnetic field. A space 
cutoff allows, further, an unambiguous definition of the hamiltonian operator. 
This is referred to by saying that we put the system in a box. We can, however, 
use the following device. Zero point energy is unobservable unless we destroy the 
crystal. In field theory, the ground state will be interpreted as the vacuum and 
it will be even harder to destroy! Energy exchanges with the crystal are insensitive 
to the choice of an origin. We declare by fiat that the ground state has zero 
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energy and we redefine the hamiltonian as 


1 +a 
Hie 5 | dk w,[ (ala, + a,al) — <O| (ala, + a,af)|0>] 


‘a (3-21) 
= | dk w, aj a, 
aes 
We shall, of course, have to make sure in relativistic theories that this procedure 
preserves Lorentz covariance. 

In this new expression the creation and annihilation operators appear in 
“normal order,” the latter to the right of the former. This is also called Wick’s 
ordering and is denoted by a double-dot symbol: 


: Fafa, + a,at): = ala, 


Note that under the normal order sign operators commute and that the definition 
uses as reference the free-field ground state. 


If we restore the lattice spacing a, for values of |k| much smaller than the edge of the Brillouin 
zone 7/a, the frequency w, can be approximated by 


2 — 
om fm? +— (1 —cos ak) > ./m? + k? (3-22) 
a 


which is just the form of the dispersion law of relativistic mechanics if k is interpreted as the phonon 
momentum. In solid-state physics, since it is unaffected by the addition of multiples of 27z/a, k is only 
a quasimomentum. 


In the framework of the harmonic theory with its dynamics dictated by (3-12), 
phonons are well-defined entities and conserved in the sense that the number 
operator 


N= [e dk ata, (3-23) 


commutes with the hamiltonian. 

The superposition of several vibration modes leads to a state where the 
phonons have only a limited individual status, while the atoms on the various 
sites can perfectly well be distinguished (it is, for instance, meaningful to introduce 
an impurity on one of the sites). Indeed, the phonons behave as a boson system. 
This manifests itself when we add interactions. We can, for instance, consider an 
extra anharmonic term in the energy or couple the crystal to a thermostat. 

The Bose symmetry appears when we generate the states by acting with 
creation operators on the normalized vacuum |0>. Unfortunately, the state aj [o> 
is not normalizable since <0] a,af |0> = <O| [a, af] |0> = 00 due to the infinite size, 
as before. We can, however, build a wave packet by linear superposition : 


a}|0> = [va f(kat o> 
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where f(k) can be chosen concentrated around a peak value k but such that 


<O| aya} |0> = | "dk, | dky f*(kx)f(k2)<9| ay ,a%, [07 


+7 
-| dk | f (k)|? < 00 
— 
In other words, f(k) has to be square integrable. The fundamental operators a, 
and aj have to be smeared with test functions in order to lead to meaningful 
expressions; they are called operator-valued distributions. The idealization of a 
wave of definite momentum k is to be thought of as a simplification when the 
spread Ak (or Aw,) is negligible as compared to the momenta (or energies) involved 
in the problem at hand. In a crystal, for instance, Ak is even limited from below 
by (Na)~1, an effect which can sometimes be observed. Even in a translation- 
invariant world, the geometry of the experimental apparatus always implies a 
finite spread. 

The state a}|0> is to be interpreted as a one-particle (phonon) state. To 
construct the full Hilbert space of states generated by vectors such as 


~ |r> = af, af, ---af |0> (3-24) 


we use the standard mathematical procedure of linear superposition and Cauchy’s 
completion. The resulting space is called the Fock space. Consider the action of 
N and H on the state |r>. Let us assume the wave functions f, to be strongly 
peaked around the values k,. Then 


Nip (5 0) \r> (3-25) 


Hence we have an r phonon state with an almost well-defined energy. However, 
let us look more closely at its wave function. Using the commutativity of the 
creation operators we may write 


ln> = | “aks | * dle, fulles)-“flke)ab, a, [0D 


— — 


2 an (3-26) 
_ { ley [ dk, = Y, fla)" filkes)al,--- af |0> 


where in the last integrand the sum runs over all permutations « of the set of 
integers (1,...,7). Up to a normalization factor, shall we say that Fys(ki,...,k,) = 
filk1)+>-f(k,) or that 


1 
Fs(k,...,k,) = ay > Silke.) °* *f(ka) (3-27) 


is the wave function of the state? This question would make sense if there 
existed an observable with different mean values for one or the other wave function. 
This is, however, not the case. Indeed, even if we seem to introduce at first the 
nonsymmetric kernel Fyg in the representation of the state r> the commutation 
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of creation operators symmetrizes it automatically. Phonons are identical particles 
and obey Bose statistics as a consequence of the basic commutation rules (3-12). 


To obtain a normalized r-particle state assume, for instance, that the one-particle wave functions 
J,(k) are orthonormalized : 


i " dk £000 fl) = 5, (3-28) 


If <r|r> = 1 then 


In=al aky--( "dl fils)" kal, al, 0) 


Hence 
+x Seog, 
1=(r|r)= ap | dk, =| dk, f*(k1)°*- f.lk;)<O| ay, -*- af. |0> (3-29) 
Moving each operator a, to the right and using aaj. = af.a, + 5(k — k’) together with (3-28) leads 
at once to 
[AP =1 (3-30) 


showing that 4 can be taken equal to a phase factor. If, however, r, of the one-particle wave functions 
are equal to fi, r2 to f2, and so on, with the f functions still orthonormal, it is readily shown that 
the correctly normalized state |r> reads 


1 +z +x 
Ir> = [iF i, Tay .2-}> = Shine | dk, =| dk, fi(ki)>* filky,) folkr,+ 1) 
rylrgl--- J-« =r 


“+ falkry+r) aba |0> (3-31) 


More generally, if F(k,,...,k,) is a symmetric function then the normalized r-particle state is 


+z ot 
\r>= af dk, | dk, F(k1,...,kr)a',,--- al. |O> 


—f 


(3-32) 
+x +x 
lapt=rt | at,--| Gk, | E(t, <0, Ke)" 


Pere 


It is an interesting exercise left to the reader to compute the wave function of a state with a fixed 
number of phonons, and in particular the ground state, in the basis which diagonalizes the field. 


Canonical quantization has been performed ata fixed time, say t = 0. However, 
the theory is invariant under time translation. We could therefore have chosen 
another reference time t and used the operators @,(t), 7,(t) such that 


Pn(t) = iLH, Pnrlt)] = tlt) — @n(0) = Pn 
tn(t) = iL H, 7,(t)] T,(0) = Tr 


(3-33) 


with 
Be iS . ° 
hE t ee —iw,t+ ikn a at gore) 


| dk 
@r(t) = e'™ ope ./4no, (a, e 
eect (3-34) 


Tn(t) = Pn(t) 
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Clearly a, e~'', af e satisfy canonical commutation rules, and H and N are 
time independent. The relation between the observables at times ¢ and 0 is a 
unitarily implemented canonical transformation. The matrix elements representing 
measurements are independent of the choice of the Heisenberg or Schrodinger 
picture 


aes AOje"*|a>= (<a e'4t) A(O)(e"'#' |a>)  (Schrédinger) 
=<al(e4 AOien} a> (Heisenberg) 


The structure of Eq. (3-34) clarifies the presence of positive and negative frequencies 
in the field. They correspond to the creation or destruction of phonon modes. 
The latter have honest positive energies (hw). Hence ¢ is not to be understood 
as a wave function. It is an operator in Fock space even though it is written 
in terms of the solutions to a wave equation. The superposition is weighted with 
operator-valued amplitudes. This procedure is referred to as second quantization. 
The mechanical example of the crystal has prepared the way for the quantum 
treatment of the relativistic scalar field. The atomic interpretation of the vibrations 
will disappear but the analogy between phonons and particles remains. We shall 
have even fewer scruples in setting the vacuum energy equal to zero. 


3-1-2 Scalar Field 


For the quantized free scalar field with an hamiltonian given by Eq. (3-11) we 
have essentially to transcribe the previous formulas in three-dimensional con- 
tinuous space. The three-momenta (an interpretation to be confirmed below) 


are denoted k, while k° also stands for a = w-, = \/k* + m? > 0. Throughout 
this book, the phase space measure for bosons will be noted: 


d*k d*k 


ae = On2en, Ont 


2n6(k? — m?)0(k°) (3-35) 


The last expression exhibits clearly the Lorentz invariance of this measure. This 
may be checked directly on d*k/q,: it is invariant under rotations; a boost of 
velocity tanh @ leaves invariant the transverse components k; as well as w? — k?. 
Therefore, 


dk, = (cost 6 + sinh 6 a dk, 
Ox 


Ox = (cost @ + sinh 6 *)on 
Ox 
proving that dk;,/a@, = dkz,/ay. 
In terms of the creation and annihilation operators satisfying 
[a(k), at(k’)] = (22)720,67(k — k’) 


Latk), a(k’y] = [at(h), at(k’y] = 0 (3-36) 
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the conjugate fields g(x), 2(x) at time r= 0, fulfilling the basic commutation 
rules (3.3) are 


(x) = {at [a(k) ex + at(k) e~® x] 
(3-37) 
n(x) = —i | a x [L.a(k) ef * — al(k) e~ eX] 
The vacuum ground state is defined through 
a(k)|0> = 0 <O/ Opt 


while the hamiltonian takes the form 
l e is 
a [ak Wy a*(k)a(k) + a(k)at(k): = {a oa! (k)a(k) (3-38) 


According to Chap. | it is natural to expect that the linear momentum operator 
P is given by 


Pi= [es ©°(x) = {ak ki Fl = | a kiat(k)a(k) (3-39) 


Unlike the case of energy, no normal ordering is required. The modes k and 
—k compensate each other, so that the vacuum is translationally invariant, 1.e., 
an eigenstate of P with zero eigenvalue. The operators H = P® and P/ commute, 
and 

[P*, a(k)] = k*a'(k) (3-40) 


showing that a‘(k) acting on a state adds a four-momentum k*. 
Up to normal ordering the energy momentum tensor density is identical with its classical counterpart 
O"(x) = d*—pd"— — g*[4(Gy)? — 4m? p*] (3-41) 
with 0° (x) replaced by 2(x). 
At time t = x° the field g reads 
lt, x) = ei 9(0, x) eH = {a [a(kye"** + ale *] (3-42) 


This expression automatically fulfills the Klein-Gordon equation. If we use the 
ad 
symbol uév for 


udv = u(dv) — (6u)v 


we can write a(k) in terms of (0, x) and 2(0, x) as 


a(k) = [ex e~ kx Tw, p(0, x) + in(0, x)] = i [es [e** Ao olt, x)|:=0 (3-43) 
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and the last expression is in fact time independent. Indeed, 
60 [ex el * 80 (x) -|as feik-* 93 (x) + [(m? — A) e***] Q(x)} 


using the fact that ki = m?+k?. In all applications we have to consider a 
normalized superposition of plane’ waves, in which case the integration by parts of 
the laplacian operator will be justified. Taking now into account the fact that 
(x) satisfies the Klein-Gordon equation, we find that the above expression 
vanishes. Hence 


a(k) =i [es et * Bye(x) 
(3-44) 
at(k) = -i [ex em = Bo (x) 


A basis in Fock space can be constructed from the normalized r-particle 
states : 


11/72 
r> = (r | aca |Fas-kdP) 


x | as ---dk, F(ks,...,k)at(ky)---at(k,)|0> 3-45) 


The wave function F(k;,...,k,) is symmetric in the interchange of the four- 
momenta k,, all belonging to the upper sheet of the hyperboloid k* = m?, the 
so-called mass shell. A number operator N commuting with P* is 


N= {ak at (k)a(k) (3-46) 


and, of course, acting on the states (3-45): 


Niner (3-47) 


It is crucial to verify that relativistic invariance has been maintained. The above formulation uses a 
particular frame, and we may wonder whether quantization in any other one obtained by a Poincaré 
transformation would lead to an equivalent theory. If primes refer to the coordinates in this new 
frame, we want to know whether there exists a quantum canonical transformation relating the 
fundamental creation and annihilation operators a and a‘ to the corresponding a’ and a’t obtained 
when quantizing on the hyperplane t’ = constant. This will be the case if to each transformation 
x-» x' = Ax + a we may associate a unitary operator U(a, A) in Fock space, such that 


U(a, A)p(x)U'(a, A) = (Ax + a) (3-48) 


Assuming differentiability in the parameters aand A itis sufficient to study infinitesimal transformations 
of the type 


x'* = x* + da",x” + dat 


with 6w,, + 6@,, = 0. Hence we want to find 10 hermitian Operators M*’, (M#” + M’*—0), and 
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P* such that 


i 
U=I- 5 6@yy»M*" + ida, P* (3-49) 
and i[5a,P* — 450,,M*”, o(x)] = 5a,0"p + 45e,,(x°d" — x*)p (3-50) 
The 10 generators P and M will have to fulfill the geometrical commutation rules 
[P*, Py]=0 
[M™, P*] = i(g"*P" — g”'Pr) (3-51) 


[ M+”, M’*) = i(g@mM’? —_ gM" = g*7M” a g’°M"7) 


where the commutator replaces (up to i) the classical Poisson bracket. As was already anticipated 
when deriving P“, we expect that these generators are given by Noether’s theorem 


PY = | ax @™ M” = | d>x (x#@° — x*@%) (3-52) 
t t 


with ©” given by Eq. (3-41), or more explicitly 
O° (x) = 4:(n? + (Vo)? + mo”): an 
©°4(x) = : ndig: ies 


The expressions for P* coincide with (3-38) and (3-39) and are indeed such that i[ P*, p(x)] = 0“@(x). 
As for M“’ we find 


Moj=i {at a'(k) ( saya 


2 a a 
M,=i {at a'to( igan ke <a 


We verify that Eqs. (3-50) and (3-51) are satisfied. Since M“” contains only an angular contribution 
the particles created and destroyed by the field g have no intrinsic spin. 


(3-54) 


Having satisfied ourselves with the covariance of the quantized theory we 
now want to explore the relationship between the description in terms of particles 
of mass m, spin zero, and the hermitian field @ which could belong to the set of 
measurable quantities. 

Even though at fixed time fields referring to various positions do commute, 
this is no longer the case when we compare them at different times. From canonical 
quantization, the commutator of two free fields y(x)and g(y) has, however, a simple 
structure as a c-number distribution which assumes the form 


[ote 0] = [dk [ere — eB) 


= iA(x — y) 
The real distribution A(x) can also be written with the help of the function 
e(u) = u/|u|: 


(3-55) 


i d*k 


r | aye 5(k2 — m?)e(k°) eo ** . (3-56) 


A= jy 
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showing that it is an odd, Lorentz invariant, solution of the Klem-Gordon 
equation. From A(O, x) = 0 and Lorentz invariance we find, in fact, that A(x) 
vanishes outside the light cone, i.e., in the region x” < 0. Measurements at space- 
like separated points do not interfere, a consequence of locality and causality. 
Note also that canonical quantization entails 


AoA(x)|x0=20 = —5°(x) (3-57) 


To construct coherent states which diagonalize the positive frequency part of the quantum field — 
the analogs of the minimal wave packets of the harmonic oscillator —we assume a normalizable func- 
tion n(k) to be given on the mass shell k? = m?, k° > 0: 


| akin < o 


Consider the state |y> given by 
\n> = exp | | a nar | |o> (3-58) 


which appears in Fock space as a coherent superposition of states with 0, 1, 2,... particles. It can 
be used as a generating function to obtain the states with a fixed number of particles if we allow y 
to remain arbitrary. Let us compute the norm of |7) using the identity e+ e? = e4* 8° (4-8)? valid 
for two operators A and B commuting with [A, B]. We find 


<M |n2> = exp | faz nt eat (3-59) 


Therefore two such states are in general not orthogonal and the set of coherent states is an over- 
complete one. If we denote |7) as the normalized state corresponding to |n): 


I apes 
m= exp | —5 | a near |in> 


we have 
(ns |n2) = exp | | (i Im ntn2 — 43] 01 — 2 | 


The vacuum state corresponds to 7 = 0, and more generally a(k)|y> = n(k) \n>, so that we have 
diagonalized the annihilation (positive frequency) part of the field 


g(x) = | at a(kye"™* — px) [n> = [a n(k) e~*"*|n> (3-60) 
Coherent states remain coherent under time evolution. Indeed, 


etn) = exp {| ak n(k) [e~'* at(k) era hioy 


= exp | | dk n(k) e~ ot a |O> (3-61) 


= Ine> 
where 7,(k) = e~ **! n(k). 
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It is also interesting to construct the unitarity operator D(n) which realizes the transformation 
D(n2) Ins) = Ins + n2) 


The reader may convince himself that 


D(n) = exp { - {a [n* (k)a(k) — noo} 


does the job. 
Since ¢(x) is an operator-valued distribution let us smear it with a test function 


2 ee [ats F(x)o(x) (3-62) 
When f approaches d(t)g(x) or 6'(t)g(x) we recover a space average of o(0,x) or x(0,x). We may 
study the probability distribution of gy, in a state corresponding to a fixed number r of particles 


pa) = <r|5(g; — a)|r> = | = en! Cyl 22%, |r) (3-63) 


18 


To compute this quantity we can use the coherent states as generating functions so the problem is 
reduced to the evaluation of 


2 
<ns| e* In2> = exp {ak nts + io(nt f +f *n2) — + \7 Pt (3-64) 
where f is the Fourier transform of f: 


fk = {ats e'*** f(x) 


Observe that only the mass shell components of f contribute to Eq. (3-64). Leaving the general 
case as an exercise to the reader, we derive the vacuum distribution of the field as 


d Hee 
po(a) = | = exp |- ina — > {ak | 700 | 


waar =1/2 a 
; ae: 3-6 
=|" [ae i7we | exp| 2 rato | ” 


Not surprisingly we obtain a gaussian distribution with mean square fluctuation 


o= | [ae mer] 


around a mean value zero. The field at a fixed space-time point corresponding to f= 1 has an 
infinite fluctuation and is therefore unobservable. But smeared operators are meaningful even when 
the function f has a support restricted to a fixed time. We note that the vacuum is not a zero field 
state and we shall soon show that vacuum fluctuations are observable. 

Instead of considering y'* (x) we may also wish to construct a complete set of tested values of 


(0, x): | 
Pn = [es (0, x)F ,(x) (3-66) 


corresponding to f,(x) = 6(x°)F,,(x) in the previous notation, with 


ae [es elk F(x) = F,(k) . (3-67) 
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normalized according to 
{at Fs(k) Fy-(k) = dn» 
F,(—k) = F*(k) (3-68) 
> Fk) Fs(k) = (272)?20,57(k — k’) 


An eigenstate of the operators g, with eigenvalues g,, may, by abuse of language, be called an 
eigenstate of the field @(0, x) with eigenvalue 


@.(X) = a Pn.c ja a FR(k) (3-69) 
such that 
[ex @-(X)F n(x) = One 


Let us denote by |g,> the corresponding state. To obtain its components in the Fock basis it is 
sufficient to know its scalar product with a coherent state |7 >. This is obtained by solving 


<n| On|Pe> = OncXN|@e> (3-70) 


for every value of n. If the normalization is such that 


dPn,e 
I = I : « c (3-71) 
{le II a” | 
we find that 
<n|@c> = exp \-4 D one + {a |- 3n*(k)n*(—k) + n*(k) > oncFut (3-72) 


from which we may study the physical content of these states. 


3-1-3 Charged Scalar Field 


The above description of the hermitian scalar field does not allow a distinction 
between particles and antiparticles. Particles and antiparticles have to carry some 
opposite-charge quantum number, whatever the nature of this charge may be. 
Classically, the minimal coupling prescription requires at least a complex field. 
In the quantum case let us therefore introduce a doublet of hermitian fields 
gy; and @2 represented by the complex quantity 


_ 91 + ip2 
< 


and its hermitian conjugate. The total lagrangian Y,,, will be the sum of two 
identical lagrangians of the form (3-7) pertaining to , and Q2: 


Prot = L(G1) + L(G2) = (0,¢)'(6"@) — moto (3-73) 


The quadratic Y is invariant under any rotation in the internal space characterized 
by the indices 1 and 2, or equivalently when 9 is multiplied by a phase and gi! 
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by the conjugate one. The invariance group is U(1) or O(2). Quantization is 
performed independently for ~, and ¢2, with the states labeled by the numbers 
‘ my and n> of quanta of type 1 or 2 corresponding to the operators 


N; = {ai al(k)a(k)  i=1,2 (3-74) 


In terms of the fields g and g' and the conjugate momenta 
Pi — ifr pie = ema 


/2 2 


L=O'= (3-75) 


the hamiltonian reads 
H= [ex :t'n + VotVo + moe: (3-76) 


Note the absence of factor 4 in Eqs. (3-73) and (3-76) as compared to the hermitian 
case. 
The commutation rules 


Lo(x), eV] =[e') @'WI=9 = Le), e')] = iA@—y) (3-77) 


reduce at equal times to 
[o(t, x), x(t, y)] = Let(t, x), x(t, y)] = i0°(x — y) 


The field itself may be written 


o(x) = {a (a(k) e~*** + bt(k) e***] 


'(x) = {ak [ b(k) poesia Gi) elk =) 


_ ay(k) + iar(k) 
= V2 (3-78) 
at(k = 0 = tall) 
= 42 
__ ay(k) — i(k) 
bik) = eee 
bt ky = 2100 + iaalk) 


ji 
[a(k), at(k’y] = [b(k), b*(k’)] = (22)°20,.63(k — k’) 


with all other commutators vanishing. 
We see that @ destroys quanta of type a and creates quanta of type b, and 
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vice versa for y', corresponding to the two observables 
Nw= {at at(kja(k) Np= | dk bt(k)b(k) (3-79) 


The choice of diagonalizing N, and N, rather than N, and N> will appear natural 
when discussing the charge below. It is reminiscent of the choice of circular 
polarization versus linear polarization of light. Of course, Na + Np = Ni + No. 
Finally, energy momentum is expressed as 


pt = | dk k*"[at(k)a(k) + bt(k)b(k)] (3-80) 


while the vacuum is annihilated by all operators a(k) and b(k). 
Applying Noether’s theorem to the U(1) invariance of the theory, we expect 
the existence of a conserved current 


j= sit 3,0: (3-81) 


and a corresponding time-independent charge 


Q= [ex :i(@'@ — ¢'Q): 
(3-82) 
= {at [at(k)a(k) — bt(k)b(k)] = Na— Np 


Indeed, we verify that 
Q=i[H,Q]=0 4,j*=0 (3-83) 


Therefore a quanta carry a charge +1 and b quanta a charge —1. The two 
types play symmetric roles and to distinguish them physically we have to introduce 
an external coupling. 

It has become traditional to identify one of the types with the particles, the 
other one with antiparticles. The symmetry of the theory requires m, = m, and 
also the equality of spins, here zero. The discrete symmetry under interchange of 
particles and antiparticles is called charge conjugation. 


Show that one can find a unitary operator @ in Fock space such that 


Go(x)6t=nep'(x) |n-|=1 (3-84) 


which commutes with the hamiltonian, and discuss its properties. 
Examples of distinct particle-antiparticle scalar pairs are the mesons n*x~, K*K~, or K°K®. 
In the last case the opposite “charge” is strangeness. 


The unified description given by the complex field @ which enters as a unique 
combination in charge conjugation-invariant couplings shows a profound relation- 
ship between the dynamical behavior of particles and antiparticles which goes 
beyond the equality of their kinematical invariants, as we shall see later. 
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3-1-4 Time-Ordered Product 


. When acting on a state the operator g'(x) creates a particle of charge +1 or 
destroys an antiparticle of charge — 1. In either case it adds a total charge +1. 
Similarly, if we act on a state with g(x’) we destroy a unit charge. We can interpret 
the combined action of the two operators, leaving the total charge invariant, 
in two ways according to the sign of rt’ — t. If t' > t we first create a particle at 
time t with g'(x) and destroy it at a later time t’ with g(x’). The corresponding 
amplitude will be given by the mean value of the operator 


A(t’ — the(t’, xo"(t, x) (3-85) 


When ?’ <1? an antiparticle has been created by ¢(x’), then absorbed by o'(x) 
at time t with a corresponding amplitude 


A(t — te (t, x(t’, x) (3-86) 


In both cases charge increases at x and decreases at x’ independently of the causal 
propagation. In the case t > 1’, instead of speaking of the creation of an antiparticle 
at x’ subsequently absorbed at x, we may say that a hole has appeared at x’ to be 
filled at the later time tr. We thus recover Dirac’s hole description of Chap. 2. 

The sum of the two operators (3-85) and (3-86) is Dyson’s time-ordered 
product 


T(x'o"(x) = A(t’ — e(x')o"(x) + Ot — Keel’) (3-87) 


appropriately named since operators occur under the T symbol arranged from 
right to left with increasing times. Bosonic operators obviously commute under 
chronological ordering. 

Let us act with the operator (1, +m’ on the time-ordered product (3-87). 
Care should be taken since the step functions are time dependent, therefore 
instead of finding zero we must obtain a distribution concentrated at equal times. 
Indeed, 


2 af.é 
<3 Tool) = = 7 < ox )o"(x) + 4 — 0LaC, orca} 


cane 5 ee "| 
= T aaa Ox) (x) + OE a) ap PO DP 


where we have used the fact that 
6(t' — t)[o(x), o'(x)] = i5(’ — HAO, x’ — x) = 0 
Now, from (3-77) and (3-57), 


ir — 1) G (x) oa os 1) = Nex Se = >) 
and using (62/dt’?)(x’) = (Ax — m”)@(x’), we find 


(aya m*)T¢(x')o"(x) = —id*(x’ — x) (3-88) 
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If instead of fulfilling the homogeneous Klein-Gordon equation the field @ would 
satisfy (1) + m?)(x) = j(x), Eq. (3-88) would be replaced by 


(Dy + m?)Te(x')o'(x) = Tj(x’)e"(x) — 16%(x' — x) (3-89) 
It follows that the vacuum expectation value 
G,(x’ — x) = iX0| Te(x’)et(x)|0> (3-90) 


is one of the Green functions of the Klein-Gordon operator. A little calculation 
shows that it is indeed the Feynman scalar propagator encountered in Chap. 1: 


d*k 1 
G(x — y) = -| See 


We conclude that the quantization of a free, relativistic scalar field has produced 
a satisfactory description of spinless noninteracting particles, obeying the Bose- 
Einstein statistics, with or without charge. In the absence of interactions the 
number of field quanta is conserved. 


eth») (3-91) 


3-1-5 Thermodynamic Equilibrium 


In a rest frame with total three-momentum zero it may happen that the correct description of the 
above quantized system, instead of being the vacuum state, is a thermodynamic equilibrium at 
temperature T = 1/kB (k is Boltzmann’s constant) and chemical potential y. This is well defined as 
long as the total number N of quanta is conserved. We return to a fixed volume quantization with 
discrete momenta : 


2n 
= yi" 


ak 1 
>—2 
lo ais 


We rescale the annihilation and creation operators by defining 


k Ny, N2, n3 integers 


whereupon integrals are replaced by sums 


a(k) = ./2a,V Ay 
in such a way that 


[Au, AL] = oyu 


with Kronecker symbols instead of delta functions. We have therefore 
H= {at ay,a'(k)a(k) = ya, AL Ay 
N= a at(k)a(k) = Z, ALA, 


To simplify we have assumed the system to be neutral. The grand partition function % is given by 
a trace in Fock space 


Fm tee et ™ = Tr exp[ =f (a, — p)ALA,] 
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Each mode contributes a factorized term, and since Tr e*414 = (1 — e4)~ 
& = exp[—2Xy In (1 — e Ml )] 


In the large volume limit the thermodynamical potential is 
d®k 

In (1 — eH) 

(3-92) 


Q= log 2 = -v [= : 


From thermodynamics we know that if p is the pressure Q = fpV. Therefore 
lia?k 
in(l— e-",—) 


Pp = 
~ BJ nye 
while the mean values of the energy density E/V (without zero point oscillations) and particle density 
N/V are 
E - 6Q d°k Or, 
ecu JO SPS 1 (3-93) 
N _ 1 @Q d*k 1 
Ve OBV ule (Qn)? eh — 4 
the familiar expressions corresponding to Bose’s statistics. In the case m = 0, and when the number of 
particles is not conserved (that is, 4 = 0), we have a situation analogous with blackbody radiation 
except for polarization. The energy density becomes 
E ak {k| 4n ke d 
— = | — = dk — —In(1—e* 
(2n)> eFlkl — 1 oa |, B dk ) 
(3-94) 


V 
In (1 — e~Flkl) = 3p 


Integrating by parts we obtain 
E Bf ak 
Vo BJ Qn? 
a classical relationship, familiar from blackbody radiation, stating that the average energy momentum 


E 
—-—3p=0 


6, = 


Denoting by parentheses thermodynamic averages (we assume now pt = 0 but keep the mass 


“aI 


tensor ©,, which has only diagonal elements @o9 = E/V, ©;; = pdj; (i, j = 1, 2, 3) is traceless 


Z, exp { — BE,} <a| Ala> 


y= > exp (— BEa) 


arbitrary) 


let us study the propagation at finite temperature. Set 
,o=- 
we gle 4 


Then 
(ALA) = Onn f(x) 

(Ay Au’) = One [1 + f(@x)] = — oun f( @y) 

(Ax Au’) = (ALAL) = 


In the infinite volume limit 
(9()e()) | a {e~* TT + F(a,)] +e & flow} 
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and the time-ordered product takes the form 


dtk enh eon a ae - 
(Tain =i feet |e el 
T = % 


This expression exhibits the propagator at finite temperature as the sum of the zero temperature 
Feynman propagator plus a temperature-dependent solution of the homogeneous Klein-Gordon 
equation which vanishes like e~’" when f > «(T 0). This propagator is interesting to study 
dynamical disturbances away from equilibrium. 

The structure of the partition function suggests that we may encounter, in applications of 
perturbation theory to statistical systems, a different, “Euclidean,” propagator. Starting from the basic 
dynamical field variables at fixed time, we define an evolution for imaginary times 


xo = —iXo 
where Xp is real, and at first restricted to the interval [0, 8] according to 


(x) = e4% @(0, x) e~ Ho = eH%o gO, x) e~ H¥ 
Operator ordering with respect to Xo will be denoted by the symbol 7 and will satisfy 


2 
e — A. + nm? \[Totooor Bee o fo) a 
Xo 


a 
It can be continued as a periodic function of Xo — Yo of period f. Indeed, assume at first fp restricted 
to the interval [0, 8]. Then from the cyclic character of the trace 


[T o(x)@\)]2,=0 = (Tr e~ 84)" ! Tr [e~*# p(y) (0, x)] 
= (Tr e~ 9*)-? Tr [e~4# ef (0, x) e F* @(y)] 


= [Toe] 2-6 


When this continuation is performed the 6(Xo — yo) function is to be understood as applied to periodic 
functions (functions on a circle) with 276(a) = 2, e'". We can then write 


vi 3 F 
[Tee] =5 >, exe | -2in FG = || ak gik-(x-y)___! 


n=—-0 B (2n)* Ke oe m? B 96) 
3 . i 
a iF k eh (xy) : 
B 4.0 —dinnjp J (27)° k? — m? 
In the last expression use has been made of minkowskian notations with x° = —i<° pure imaginary 


as well as k®. If we denote by the symbol f' d*k/(27)* the combined discrete sum and integral 


1 |e 
B 40 ~7ninjp J (22)? 


the structure of (3-96) is formally identical to the ordinary propagator. We observe, however, that the 


denominator 
DnNe 
k? — m? = -((F) ke +m] 


never vanishes. The reader should realize that (3-95) and (3-96) represent very different quantities 
relative to unrelated problems. 


QUANTIZATION—FREE FIELDS 127 


3-2 QUANTIZED RADIATION FIELD 


‘The above thermodynamical example leads us at once to the radiation field. 
Indeed, Planck’s blackbody spectrum was historically the first instance of field 
quantization. We have simply to generalize the preceding construction to the 
degrees of freedom described by the potential A,,(x). Since we insist upon the 
local character of the theory, we cannot make use of the gauge-invariant tensor 
F’,, as the fundamental dynamical variable. We have to face the gauge dependence 
of A, already discussed at length in Chap. |. Various devices have been used to 
circumvent this difficulty. We shall not try to quote all of them here and present 
the Gupta-Bleuler indefinite metric quantization. 


3-2-1 Indefinite Metric 
We recall that the lagrangian 
L = —4F? (3-97) 


where F is expressed in terms of A, is unsuited for canonical quantization since 
the conjugate momentum of Apo vanishes. If we were not to worry about manifest 
Lorentz covariance we could content ourselves with A by constraining A° and using 
a condition of the type div A= 0. This of course is physically reasonable, but 
does not fit our goals. We therefore use the procedure of modifying the equations 
of motion at the price of restoring Maxwell’s theory by appropriate constraints 
on the physical states, since we shall introduce spurious degrees of freedom. The 
price will seem at first rather heavy, for not only will the Hilbert space of 
quantization be too large but instead of carrying a positive metric it shall allow 
for negative norm states! Thus the usual probabilistic interpretation of quantum 
mechanics will only emerge when we will have restricted ourselves to the physical 
quanta: the photons with only two polarizations. We also recall that to preserve 
gauge invariance, the principle which justifies the above procedure, we will have 
to make sure that the current remains conserved when introducing interactions. 
We therefore modify (3-97) to read, with an arbitrary constant 4 # 0, 


L= —<F? -$ (0-4) (3-98) 
so that Maxwell’s equations are replaced by 
DA, — (1 — A)d,(8° A) = 0 (3-99) 
and the conjugate momenta to the four components of A are 
BRA 
= — FR _ J g°°(d- A) (3-100) 
(AoA) 


In particular x° no longer vanishes. 
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Whether the right-hand side of Eq. (3-99) vanishes, as it does in this free case, 
or not if we replace it by a conserved current, we find that 


Oa: 4) =0 (3-101) 


so that 0: A is a free scalar field. Classically we argued that appropriate con- 
ditions could be imposed on the boundary data in order that d-A vanishes 
everywhere as a consequence of Eq. (3-101), hence restoring Maxwell’s theory 
in the Lorentz gauge, 0: A = 0. This cannot be achieved in the quantum case. 
Indeed, we shall assume the canonical commutation rules 


[A,{t, x) 2°(t, y)] = ig, 0x — y) (3-102) 


The operator equation @- A = 0 is then inconsistent, since up to a constant factor 
this is 2°, and the latter has nonvanishing commutators with A® at equal times. 
We see at once that quantization according to (3-102) has overdone its job by 
adding unphysical features which will have to be eliminated. Even though all 
calculations can be pursued, keeping A arbitrary, in the interest of simplicity we 
shall henceforth set 2 = 1, in which case Eq. (3-99) reduces to 


mie 0 ee (3-103) 


In the quantum case the choice of / is called by abuse of language a choice of 
gauge, and A = 1 is referred to as the Feynman gauge. 


The price for simplicity is that we will not directly verify that our results are 4 independent. Within 
the framework of the free field we leave it as an exercise to the reader. He or she may as well dis- 
cuss alternative forms of (3-98), where the added term might be taken as (A/2)[0- A — @(x)]? with 
(x) a c-number classical field or (A/2)[n(x)* A]? and n(x) a classical vector field (perhaps constant). 


The commutation rules (3-102) are completed by requiring that 


[A,(t, x), A,(t, y)] = [7,(t, x), 2,(t, y)] = 0 (3-104) 


From (3-104) it follows that the spatial derivatives of A, commute at equal times 
so that (3-102) and (3-104) can be recast in the following form: 


[A,(t, x), A.(t, y)] = [A,(¢,»), Av(t, y)] = 0 
[A,(t, x), Av(t, y)] = ig pv O-(X — y) 


analogous to the scalar case repeated four times. Corresponding to the reality of 
the classical potential components we assume their quantized counterparts to be 
hermitian. 

The dynamical system is therefore specified by (3-103) and (3-105). We note, 
however, a slight difference as compared to the scalar case, where the nontrivial 
commutator was 


(3-105) 


[ p(t, x), A(t, y)] = id°(x — y) (3-106) 


Only three out of the four components of A,(t, x) behave as in Eq. (3-106). The 
commutator involving Ao(t, x) has a reversed sign, as if the role of the field and its 
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conjugate variable were interchanged. This is related with the four-vector char- 
acter of A, and generates, as we shall soon realize, a very different Hilbert space 
' when we “solve” (3-105) as we did in the preceding section. 

In order to fulfill this program, we Fourier analyze the field in terms of plane 
wave solutions of Eq. (3-103) and set 


3 
A,(x) = [at Y Law Pky em ** + at kye*(k) e**] (3-107) 
A=0 


with, as in (3-35), 


ko = |k| (3-108) 


For each k on the positive light cone, e{(k) are a set of four linearly independent 
vectors which we may assume real. The symbol | refers to an hermitian conjugation 
with respect to a nondegenerate scalar product in Fock space. As indicated in 
(3-108) the field quanta are massless. 

In order to exhibit the content of (3-107) we shall make a definite choice of 
polarization vectors e”(k) with respect to a fixed reference frame. Let n stand for 
the time axis n? = 1, n° > 0. We choose e‘!) and ¢% in the plane orthogonal to 
k and n such that 


eMk)-e@(k)=—d,y AA =1,2 
We then pick ¢'* in the plane (k, n) orthogonal to n and normalized 
eNk)en=0 [ek]? = -1 


Finally we take e equal to n. With these conventions we call respectively 
and ¢') transverse, e'°) longitudinal, and e°) scalar polarizations. In the frame 
where n° = 1 and k is along the third axis, we have simply 


‘1 0 0 0 
0 1 0 0 
(0) _ a (re (3) = 
aie ||, el é 1 é 0 
\0) 0 0 1 
In any case we have 
ey (k)ey*(k) e(k) + 62*(k) = 9 (3-109) 


x BCR ae) — 


The denominator in the first expression is required because of the indefinite scalar 
product. From (3-107), A, automatically satisfies the field equations. The com- 
mutation relations (3-105) follow provided 


[a (k), at (k’)] = —g*?'2k°(2n)°5°(k — k’) (3-110) 
Furthermore, we readily find at arbitrary times 
[A,(x), A,(y)] =a 1g py» A(x an y) ; (3-111) 
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with A defined as in (3-56) for m = 0. We feel secure as we see covariant expressions 
such as (3-111) emerge naturally. We would then proceed by introducing a vacuum 
state annihilated by the various operators a‘”)(k): 


a(k)|0>=0 A=0,1,2,3 


But somehow we feel uneasy, as we have seemingly introduced twice as many 
polarization states as are usually attributed to photons. The trouble is explicit 
when we construct a one-particle state with scalar polarization : 


I> = [a f(a "(k) [0D 


Let us compute 
G11 1 ex | fat dk’ f(k)*f(k’) <0] a (kya *(k’) 10> 
According to (3-110) we find the paradoxical result that 
<1]1> = —<0|0) aos 


Fock space has an indefinite metric! The same calculation repeated with the 
three other types of polarization leads to states with positive metric. What about 
the probabilistic interpretation of quantum mechanics ? 

As long as we discuss the free case, the situation is not as catastrophic as it 
may seem, but interactions may excite the unwanted states. The crucial observation 
is that up to now we are not really dealing with Maxwell’s theory since we have 
explicitly modified the lagrangian. To recover it, we would like to set 0- A = 0, 
but this is impossible as an operator equation. We may therefore at least try to 
select the states ly> such that the Lorentz condition holds in the mean 


<W|d-AlW> =0 (3-112) 


To preserve the linear structure of the physical Hilbert space #,, we require 
that the annihilation (positive frequency) part of 0- A annihilates #7, : 


AL hy? —0 (3-113) 

Clearly (3-112) will follow from (3-113). 
Let us now analyze the states belonging to #,. Since condition (3-113) is 
linear, we can consider basis states obtained by acting on the vacuum with 


(smeared) products of creation operators pertaining to the various polarizations. 
We therefore factorize these states as 


lW> = lbr>|o> (3-114) 


where |i» corresponds to transverse photons and |\p> is obtained by acting on 
the vacuum with longitudinal and scalar operators. This decomposition depends 
on our conventions for the polarization vectors. It is clear that it is sufficient to 
examine the consequence of condition (3-113) on such states as |) since 0+ A 
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only involves longitudinal and scalar polarizations: 


id> A) = | a ee SY a MkeM(k)-k (3-115) 
A=0,3 
so that (3-113) reduces to 
Yk eka) |o>=0 (3-116) 
A=0,3 


It would be too much to expect that (3-116) could determine lp> entirely. Further- 
more, we must allow for the arbitrariness in the choice of the transverse polarization 
vectors, to which we may add at will a term proportional to k. This arbitrariness 
must be reflected in the |¢> part of the state so that we may at most expect that a 
given physical situation is represented by a class of equivalent vectors in #,. 
We want, of course, to verify that vectors in this subspace have positive square 
norm. 
An equivalent form of (3-116) is 


[a(k) — a(k)] |p> = 0 (3-117) 


A state |p> may be a linear combination of states corresponding to 0, 1, 2,... 
scalar or longitudinal “photons:” 


|O> = Colbo> + C1|o1> +7 + Caled +> 
lfo» = |0> 


The number operator for these photons is, by virtue of (3-110) (note the minus 
sign), 


(3-118) 


N= | dk fa (kya (k) — a (kak) ] (3-119) 
A state |f,> such that (3-117) is satisfied will be such that 
nn | Pn> = 0 
Hence 
<bn| bn» = 6n,0 


If n ~ 0 such a state therefore has a zero norm. A general solution is of the form 
(3-118) with 

<$|¢>=|Co|? = 0 
the coefficients C; remaining arbitrary and each of the |@,,> fulfilling (3-117). It 
remains to show that the arbitrariness in |f> does not affect the physical 
observables. Consider the energy, for instance. The hamiltonian reads 


ve | @Pxintd, — 2: =5 | dx: D° [AP + (VA)?] — AB - (WA): 
- (3-120) 
3 
i {at Or b a "(kya (k) — erina | 
: . 
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When acting with H on a state in #, we find in general a contribution from 
the unphysical part |p >, but it vanishes when we take the mean value since 


Cal | dion Yd bd'\br) 
p 


Cul W> _ | 
<y|y> , (br| vr? 


by virtue of (3-117). As far as momentum is concerned, the formula is analogous 
with @, replaced by k. Insofar as only average values are observed, we see that 
not only have negative probabilities disappeared when restricting ourselves to # 
but also scalar and longitudinal photons do not contribute. Only the two physical 
transverse polarization states manifest themselves. 


(3-121) 


It is nice to realize that the arbitrariness on |¢> is reflected by the fact that the mean value of A, 
in such a state is a pure gradient. Let us compute it as an exercise. We readily derive from (3-118) that 


«| Ayulx)|o> = C$C1 <ho| [kena [el (k) ak) + e!(k)a'(k)] 161 > + complex conjugate (cc) 


All other components |, > do not contribute since A, changes n by a unit; 
to |0>, and |¢1 > is necessarily of the form 


do» can be taken equal 


|o1> = {aa F(a) [a™*(q) — a*(q)] |O> 


Choosing real polarization vectors we have 
<$| A,(x)|6> = [a [el (k) + ef '(k)] [C8C, e~ *® =F (k) + cc] 


We recall that ek) = n and by its definition 


ak n(k +n) 


(3) 
ear 


Therefore, 


1 
ek) + 1K) = a ky 


Hence, provided the integral is meaningful, 
dk 


<4 A,0010> = a |e 


[iC$C, e~***f(k) + cc] = 0, A(x) (3-122) 


where the scalar c-number function A(x) is a solution of MA(x) = 0, and may be chosen at will by 
adjusting the vector |>. This is another proof that the arbitrariness of \p> reflects the gauge 
arbitrariness of A, without practical consequence. 


We may select in the equivalence class of state vectors of the form lWr>|> 
a convenient representative by imposing that |f> = |0), this again with reference 
to a choice of basis polarization vectors. Instead of taking the transverse ones, 
real and orthogonal, we frequently choose the complex combinations (e"” + ie, /2 
to represent the two helicities of the photon. 
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We shall not elaborate in detail the proof of Lorentz invariance. It is clear that #, is Lorentz 
invariant as well as the equivalence class representing a given physical state. The reader is invited 
‘ to construct explicitly the generators M“” of Lorentz transformations. He or she will verify that photons 
are not only massless but carry an helicity +1. It is sometimes said that the photon has spin 1. 


In spite of the fact that only vectors in .#, receive a physical interpretation 
it is worth emphasizing that the full indefinite norm Fock space is necessary to 


preserve the locality properties. They appear generally in complete sums over 
intermediate states. 


AS an exercise one can easily extend the discussion of Sec. 3-1-5 to recover the blackbody spectrum 
within the present formalism. 


3-2-2 Propagator 


We expect the propagator to be defined, as in the scalar case, by the vacuum 
expectation value of the time-ordered product. Up to now we have dealt with the 
Feynman gauge. Time ordering is performed as in Eq. (3-87) by multiplying 
products of fields by step functions 


TA,(x)A,(y) = A(x°® a y°)A, (x) Ay(y) sp Ay® a x°)A,(y)A,(x) (3-123) 


This, however, may sometimes be too naive. This expression is not a priori defined 
at equal times. Fortunately, we do not encounter this difficulty in the present 
case. Inserting the expansion (3-107) and using the commutators (3-110) we find 


d*k e” ik «(x—y) 


eerie 3-124 
(Qn)* k? + ie ae 


<0| TA,(x)A,(y) lo> = iG uvGr(x - y) ao = SS 10ns 


Each component of the field behaves independently and we note a change in sign 
between the spatial and time components. Explicitly, when m = 0 the Feynman 
propagator in x space reads 
4 =H s08% 1 1 
Gr(x) = a Ee | (3-125) 


The relation of these formulas with the measuring process of electromagnetic 
fields is discussed in the classical papers of Bohr and Rosenfeld. 


Let us make a final comment concerning an arbitrary choice of the parameter 4 in Eq. (3-98). From 


it we derive the field equations (3-99) and equal-time commutators with a more complex structure: 


[A,(t, x), A,(t, y)] = 0 
—A 
[A,(e, x), A(t, y)] = in(1 am — sno) =) 


[Ai(t, x); A(t, y)] = [Aolt, x), Aolt, y)] = 0 


(3-126) 


A 


a 
[Ao(t, x), Ai(t, y)] =i axl 6°(x — y) 
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The different components are therefore coupled. If we write the classical solutions of (3-99) as 


d*k 2 


we get the homogeneous matrix condition 
[kg + (1 — A)k*k,] AX(k) = 0 (3-127) 


In order to find nontrivial solutions it is necessary that the determinant 4(k*)* vanishes, which implies 
that A,(k) has its support on the cone k? = 0. It cannot, however, be of the simple form a,(k)O(k?); 
this would imply that A,(x) satisfies the homogeneous Klein-Gordon equation, which is not the case. 
Necessarily, A, (x) involves 5(k?) and 6'(k?). An equivalent way of looking at this is to observe that 
for nonvanishing k? we have 

I 1-A k@k 


eas ee (3-128) 


[kr —(1-Ak® k\~? = ke (k2)? 


We have left to the reader the task of going through the details of the construction of the indefinite 
metric space of states in this case. Let us, however, mention the construction of the propagator. We 
have with the naive definition 


[O.g", — (1 — )6*d,]<0| TA*(x)A”(y) |0> = <O| TL[O.g", — (1 — A)0*A,] A*(x)A"(y) |0> 
+ 5(x° — y)<0| [4%x), Ay] |O> (3-129) 
— (1 — A)gt°d(x° — y°)<0|[6,4%(x), AX] [O> 


Using the field equations and equal-time commutators we observe that noncovariant terms conspire 
to disappear, and we are left with 


[O.9", — (1 — A)d“0,]<0| TA°(x)A(y) |0> = igt”d*(x — y) (3-130) 


Therefore, using (3-128) and Feynman’s ie prescription, 


4 wy — HY 
<0| TA*(x)A*(y) |0> = -i| alls z be 4 Jee | (3-131) 


= kite) a 
(2n)* k? + ie A (k? + ie)? 


It is not obvious that the denominator in the second term reads (k? + ie)? and not (k?)? + ie, for 
instance. This could only be checked by a complete calculation, but it will soon become clear when 
we study the massive vector case. 

When 4 = 1 we recover the Feynman gauge. When j > 0 we see the type of singularity which 
arises. The limiting 4 — 00 case is called the Landau gauge. Physical results should not be affected 
by the value of 4. If, for instance, we compute an integral of the type 


| dy (0 TA*(x)A") 04,09) 


with a smooth conserved current we can easily check that it is indeed independent of the value of A. 


3-2-3 Massive Vector Field 


According to Maxwell’s theory photons are massless or, equivalently, electro- 
magnetic forces are of infinite range. This vanishing mass is at the origin of the 
infrared catastrophe, the emission of infinitely many soft photons whenever a 
charged particle is accelerated. Let us investigate how a small mass would affect 
this behavior. Kinematically, a new (longitudinal) polarization state would be 
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present. Couplings should therefore be such that, in the limit where the mass py 
goes to zero, the ratio of the couplings of longitudinal to transverse modes should 
vanish. 


Classical spin 1 massive particles may be described by the Proca equations 
for the four-vector field A,(x): 


Ce A — 0 F°* = Gg? AY — 0’A® (3-132) 
Taking the divergence of this equation we find 
p?d-A=0 (3-133) 
For yp” # 0, A is divergenceless and Eq. (3-132) reduces to 
(O+p)A,=0 @-A=0 (3-134) 


The vanishing of @- A ensures that one of the four degrees of freedom of A, is 
eliminated in a covariant way, so that the field quanta will indeed carry spin 1. 
The lagrangian leading to (3-132) is easily recognized as 


G=—-hF? 4424 (3-135) 


Note the plus sign in front of the mass term to be compared to the analogous 
—(m?*/2)g? term in the scalar case (3-7). This is easily recalled if we consider 
that space-like components, contributing negatively to the invariant square, 
represent the physical degrees of freedom. 

After quantization the field A,(x) will have the following expansion: 


A(x) = {a » [a'(k)e(k) ek x 4 glAt(ke*(k) elk *] 


ko = Jk? +? 
4=1,2,3 


fa(k), a*"(k’)] = 54 ,:2k°(22)?53(k — k’) (3-136) 


The three space-like orthonormalized vectors e')(k) are simultaneously orthogonal 
to the time-like vector k, so that if we assume them real 


Kok 
e(k)- ek) = 03! yy eA (kel (k) = ee @ a te) 


The naive construction of a time-ordered product leads to a noncovariant 
propagator 


a —koky/e? 2 
<0|TA,(x)A,(y) io = -i| ak ea ik- (&—y) Ipv — Koky/ ee aa 5,05y00%(x — y) 


(Cx k? — pw? + ig 
(3-137) 
while the commutator equals 


*k k ky 
[A,(x), Ay] = — \ae p Oe u2)e on ik (ey) @ = 2 ) (3-138) 
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The noncovariant contribution to the time-ordered product is, however, a localized 
distribution at coinciding points. We have already noticed that time ordering left 
such a distribution undefined. We can therefore define a covariant propagator 
by simply omitting these terms without affecting its properties. We shall give a 
more thorough discussion of such phenomena in Chap. 5. 

If we were not interested in describing the limiting zero-mass case the above 
description of massive spin 1 particles would be perfectly sufficient. It uses a 
positive norm Fock space with only physical states. But we encounter singularities 
as we try to set u? = 0. This is reflected in the fact that the limiting lagrangian 
—4F? is unsuitable without a gauge prescription. We seek for an equivalent 
description admitting the correct zero-mass limit. 

The trick is to introduce, very much as we did for the electromagnetic field, 
an indefinite metric Fock space with an auxiliary condition. We study therefore 
Stueckelberg’s lagrangian 


L = —4ZF? + Sy? A? — $10: A)? (3-139) 


in which for 2 # 0 the limit u” — 0 is not singular. To recover a gauge-invariant 
theory in the limiting case it's essential that the field A, be coupled to a conserved 
current. 

From (3-139) the variational equations read 


(CO + p?)A? — (1 — A)a2(d- A) = 0 (3-140) 


If we take the divergence of both sides we now get 
{co a: Ele A=0 (3-141) 


which remains valid if on the right-hand side of (3-140) we have a conserved 
current instead of zero. 
For 4 #0, 6:A is a scalar field satisfying a Klein-Gordon equation with 


square mass m? = pu*/A(m? could be negative; however we shall assume 2 to be 
positive for this not to ee The field 


A 
A, = Ap+—3 3 0,0°A= oo cee (3-142) 
is divergenceless as a consequence of (3-141) 
oA, — 0 (3-143) 
The corresponding splitting of A into a “transverse” (spin 1) and “scalar” part is 
A 
A, = Al — me 0,(0- A) (3-144) 


Canonical quantization introduces four creation and annihilation operators 
satisfying 
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[a(k), a? (k’)] = 6,,(2n)?2 /k? + rok —k) 1 <41' <3 
fa(k), a t(k’)] = —(2n)32 ./k? + m? d(k — k’) 


and all other commutators vanish. The minus sign in the last commutator (3-145) 


is a signal of the indefinite metric case. Two different masses occur in the vector 
and scalar case. The field itself has components on the two hyperboloids: 


(3-145) 


a?k 3 | 
A,(x) = = Y Lal (k)e(k) e «x ae at (k)e*(k) er] 
2(2my> Je? + pe ak , 
a°k kp (0) -ik+x (0)t ik x 
a sem? Jz amt w (k)e + a '"(k) e ] 
T m 


(3-146) 


As before e(k), for A = 1,2, 3 are three orthonormal space-like vectors orthogonal 
to k(k? = 1”). We may easily verify that Eqs. (3-143) and (3-140) are satisfied by 
this expression while the corresponding covariant propagator is 


O| TA >= ak; =nhe — kk,/u? k ky /p? 
X)Ay =--i ik+(x—y)[ Ze pv, piv 
< | ol ) (y)| > i( ye (5 2 creak (2 m2 L ig 
(3-147) 


2 
2_H 
m = — 

A 


and the commutator reads 


d®k eae. yee, k pky 
[A,(x), A(y)] = -|e% 1 Es (gre? = e**' »( an = 


m i. /\? +? 
+ E (ay) >= gunn) te = (3-148) 
2ko wr oe Seem 


The reader is invited to study the limits » + 0 and 4-0. 


(a) When 4 0, » # 0, show that we recover Proca’s theory. 
(b) If A# O and yp — 0, m goes to zero. Show that in this limit the Green functions tend to the values 
given in the preceding subsection. 


In particular, since 


Gov — Kokvlt? | Koko” __ ov ean ze et (3-149) 
ea ptie ke —m tie M—prt+ie A (k?— p?/A+ ie)(k? — pe + ie) 


We see that the double-pole term tends to 1/(k? + ie)? as anticipated in Eq. (3-131). We shall return 
to this zero-mass limit in the following, in particular in Chap. 4. It should also be noticed that in 
k space the propagators in the Stueckelberg formalism behave as 1/k? for large k, while this is not 
the case for the massive Proca propagator (3-137). As a supplementary exercise one may check the 
covariance of the theory and construct the generators P, and M,, of the Poincaré group. We have 
not offered here a very thorough description of the kinematical properties of photons. On this point 
the reader is referred to the works quoted in the notes. Rather, we had in mind to pave the way to 
recent developments in gauge theories, to be considered in Chap. 12. 

We close this subsection by describing the principle of a method allowing us to set an upper 
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limit to the photon mass using terrestrial measurements. This is the so-called constant field method of 
Schrédinger. Assume as a first approximation that the earth can be simulated by a perfect magnetic 
point dipole M. The corresponding localized conserved current is such that 


M= | xx xin 


and can be written j = —4M x V6°(x). In the static case, let us derive from Proca’s equations the 
corresponding vector potential such that div A = 0, a condition compatible with these equations. 
It follows that 


a 2 es 2 ee d°k kx xk 
(—A + p’)A =j= —2M x V6>(x) = slap 


DATE ens (—) 
ape Mxk—— = -MxV 
5 {os ne + Snr 


The corresponding magnetic induction is 


a solution of which is 


eu e7HF 


B = curl A= — V x (M x Y) = [(M:V)V — MA] 


Sur Sxr 


Let us take the z axis along M with unit vector 2, so that B has the form 


=H) > 
<_< |M| {LF 2) — 42](u?r? + Bur + 3) — 32,777} 


8xr3 


B 


The field has been split into two parts. In the first [#(?- 2) — 42] has been factored out, corresponding 
to the angular distribution of the magnetic field in the limit 1-0. At constant distance r there 
appears a uniform added term antiparallel to the earth’s dipole. A careful scan of the angular 
distribution of the magnetic field allows us to set an upper limit to the mass yp of the order of 


4x 107*8 9 (3 x 10715 eV ~ 10° '° cm™!). More recent measurements using a different method have 
improved this result by an order of magnitude. 


3-2-4 Vacuum Fluctuations 


It is instructive to study simple effects arising from the field quantization before 
embarking on the coupled nonlinear problems. Especially interesting are those 
which do not seem to rely on the particle interpretation—in the present case, 
the photon aspect. We give a schematic description of two such situations which 
have to do with the observability of differences in vacuum fluctuations. We have 
encountered such phenomena when presenting Welton’s interpretation of the 
Lamb shift in Chap. 2. 

We can take into account simple macroscopic sources by modifying boundary 
conditions on the field which was considered up to now in free space. This pro- 
cedure is to some extent unsatisfactory since it does not describe the microscopic 
mechanism responsible for these boundary conditions. But it is suitable for elemen- 
tary calculations. The original observation of Casimir (1948) is that, in the vacuum, 
the electromagnetic field does not really vanish but rather fluctuates (compare 
Sec. 3-1-2). If we introduce macroscopic bodies—even uncharged—some work 
will be necessary to enforce appropriate boundary conditions. Intuition on the 
sign of this effect is lacking, so that work here is meant in some algebraic sense, 
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meaning the difference in zero point energies between the two configurations. 
We had originally disregarded the (infinite) contribution 2 4hw, to the hamiltonian 
‘by arguing that it was unobservable. However, its variation can be measured. 

Let us illustrate this point for the simple configuration of two large parallel 
perfectly conducting plates as considered first by Casimir. Of course, we can study 
different geometries and different materials with similar results (except perhaps for 
crucial signs). We idealize the plates by two large parallel squares of size L at a 
distance a (see Fig. 3-1) with a « L. Consider the energy per unit surface of the 
conductor with respect to the vacuum. Its derivative will be a force per unit surface 
with dimension ML~'T~? (where M is mass, L length, and T time). The only 
quantities entering the problem are h,c and the separation a (the boundary 
conditions E perpendicular and B parallel to the plate at the interface do not 
introduce any dimensional quantity). Of course, the effect is proportional to h, 
as is the zero point energy. The force per unit surface is therefore proportional 
to hc/a*, the only quantity with the required dimension. We shall see that it is 
attractive. 

Consider the modes inside the volume L*a, where L »a and we ignore the 
edge contributions. As we know, only transverse modes contribute to the energy. 
If the component k, perpendicular to the plates is different from zero it can only 
take discrete values k, = nz/a (n = 1,2,...) to allow for the nodes on the plates 
and there are two polarization states. If, however, k, vanishes only one mode 
survives so that the zero point energy of the configuration is 


h dk, 


oie ae | 2 eae te)" 
=)'2 Raa a5 Dai l=> (2x)? | ll rama Il az 


Figure 3-1 Casimir effect between two parallel plates. 
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As it stands this expression is, of course, meaningless, being infinite. But we 
must subtract the free value which contributes in this same volume a quantity 


Aap? 
Bo = | aoe a > pmich 


(2n)* 2n 
ae ve (He : \, dn 2./k3 + n?n?/a? 


Therefore the energy per unit surface is 


E-—E, ic (| k ie oo + 
OS 2 oe [ k dk G - ae k? + n?x?/a? — { dn /k? + mia | 


This quantity is apparently still not defined due to ultraviolet (large k) divergences. 
However, for wavelengths shorter than the atomic size it is unrealistic to use a 
perfect conductor approximation. Let us therefore introduce in the above integral 
a smooth cutoff function f(k) equal to unity for k < k,, and vanishing for k >k,, 
where k,, is of the order of the inverse atomic size. Set uw = a?k?’m; then 


é= hems | iu S9(% Vi) +S ner (Ear 
[arr] 


= oan Ee SR) i) I. dn ro 
a 0 


Here we have defined 
Fin)= | iu Ju f(2 Jute) 
0 
The interchange of sums and integrals was justified due to the absolute con- 
vergence in the presence of the cutoff function. As n> oo, F(n)>0. We can 


use the Euler-MacLaurin formula to compute the difference between the sum and 
integral occurring in the above bracket: 


R 1 
$F(0) + F(1)+ FQ2)+°°°- | dn F(n) = — 5 BaF"(0) — Bk") +: 
10) 
The Bernoulli numbers B, are defined through the series 
a ne ae 
—1- 2, 25 
and B, = 4, By = —,.... We have 


F(n) = ly du Jis(4) F'(n) = -2n's(@) 
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We assume that f(0) = 1, while all its derivatives vanish at the origin, so that 
| F'(0) = 0, F’’(0) = —4, and higher derivatives of F are equal to zero. All reference 
to the cutoff has therefore disappeared from the final result 


-» POREB, a Ec 
wae” 708 
The force per unit area F reads 
n* he 0.013 
F = —-~——{ = ‘ - 
740 a tae dyn/cm (3-150) 


and its sign corresponds to attraction. ; 

This very tiny force has been demonstrated experimentally by Sparnay (1958), 
who was able to observe both its magnitude and dependence on the interplate 
distance! 

The above derivation may be criticized on account of the fact that we have 
seemingly disregarded the effects outside the plates. In the present case they turn 
out to cancel exactly. The lesson is that vacuum fluctuations manifest themselves 
under very different circumstances than those encountered in particle creation or 
absorption. By considering various types of bodies influencing the vacuum con- 
figuration we may give an interesting interpretation of the forces acting on them. 
This is one line of thought that may be kept in mind and in which we may 
recognize some of the origins of Schwinger’s source theory approach to quantum 
field phenomena. 


As another example let us give a brief discussion of Van der Waals forces among neutral atoms 
or molecules. It can be presented by studying the field fluctuations in the presence of the two systems 
very much as above. After all, we should be able to give a microscopic description of the Casimir 
effect among macroscopic bodies using the residual forces among constituents. Rather than actually 
doing that we shall follow a presentation due to Feinberg and Sucher. A more detailed treatment 
is given in Chap. 7. The classical energy density of the electromagnetic field is (E? + B?)/2, as we 
recall from Chap. !. A neutral system may interact with an electromagnetic field. Let ae(ag) be the 
static electric (magnetic) susceptibility. In a static electric field if p denotes the charge density, 
J d?x p(x) = 0 because of neutrality. The polarization is defined as P = J d?x p(x)x and the corre- 
sponding contribution to the energy due to a change in the clectric field 6E = —VO6V is 
§ dx pdV = — f d3x p(x)x dE = —P- 6E if dE is almost constant over the extent of the system. The 
susceptibility is such that P = agE for weak fields; hence under a small change 6E, the variation 
of the interaction energy is —a,E-:6E = 6[—a,(E*/2)]. A similar reasoning yields the magnetic 
contribution, so that for the interaction energy we have 

E. 


Int 


=> —4(a,E? + a pB7) 


We attempt a relativistic phenomenological quantum generalization of this result. We want to write 
an interaction part of the lagrangian such that when integrated over three-space it reproduces in a 
static field and in the nonrelativistic limit, the negative of the above expression. The neutral system 
may be described by an hermitian scalar field g, and the only Lorentz scalar quadratic in F,, and in 
y having the required property is 


— Ling = 910.90, 9F °F", + 9297 F* 
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with g; and gp related to w, and «, as follows: 


1 m 
= — = ——w 
St (a+) G2 ri 


These relations and also the desire of having Y,,, quadratic in (instead of linear, for instance) 
stem from the fact that, in order to make the comparison with the nonrelativistic limit, we have to 
take averages in a one-particle state. In this nonrelativistic limit ¢1| fd? 2enp2(eay| Le Loe Lil 
and <1] f d?x :(Ao@)?: |1> = m<1| 1. Since — Fin = inn» the above expression enables us to obtain 
an asymptotic form of the static potential between two neutral polarizable systems. The reasoning 
is patterned after a similar derivation of the Coulomb potential between charged systems which goes 
as follows. The interaction energy of charge 1 with the field is written § dex jx) A(x) but A(x) 
is generated by charge 2 as A(x) = J d*y G(x — y)j'*y). Since j') is assumed to be time independent 
we may first integrate over y° using 


1 1 
Cia SSS 
Ax — y) 4in? x2 — x? — ie 


This leads to 


1 1 
A= | dy — ayy 
4n|x—y| 


and hence to the usual Coulomb interaction 
Fy) 


1 
B= lax oy Oa 
Int fexay, ray | 


Let us return to our case. We are obviously interested in Green functions of quadratic operators 
such as F? (occurring as the coefficient of g2), for instance. If G,,.49(x — y) stands for 
<0] TF ,o(x)Fap(y)|0>, a Green function for F? will be proportional to the square of the above 
expression, that is, Gyo,09(x — y)G°"**(x — y). This distribution is typically of the form 1/(x* — ie)*. 
The contribution arising from g, has a similar structure. Consequently, we expect a potential energy 
proportional to 


1 constant 
[(xo — yo)? —(x—y)? —ie]* |x-y|’ 


This result originally derived by Casimir and Polder is at variance with the nonrelativistic treatment 
of Van der Waals forces, the outcome of which is a potential with a decrease of |x — y|~° at a large 
distance. We shall derive in Chap. 7 the full expression given by Feinberg and Sucher for the potential 
at large separations: 


vix—y)~ | do 


1 
Vase ae PEE oy [23(abok + afso'e) — T(akat + abor8)] 


and comment on its domain of validity, thereby explaining the discrepancy with the |x — y|~° non- 
relativistic law. 


3-3 DIRAC FIELD AND EXCLUSION PRINCIPLE 


In the preceding chapter we have already introduced Dirac particles and presented 
the arguments requiring such a theory to be described as a many-body theory. 
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Canonical quantization has led up to now to an interpretation in terms of Bose- 
Einstein particles, but experiment shows undoubtedly that spin 4 particles such 
_ as electrons and nucleons satisfy Pauli’s exclusion principle and obey Fermi-Dirac 
statistics. We have thus to modify the rules of the game when dealing with half- 
integer spin particles. 


3-3-1 Anticommutators 
We have to consider the Dirac equation on a similar footing to Maxwell’s 
equations. Both are classical field equations. We have therefore to introduce a 
lagrangian and conjugate momenta for the complex fields w and wt (or W = Wty”). 
In the free case the equations 
—: = — 

(iy; -mw=0 liy-d +m =0 (3-151) 

follow from a variation of an action using as density the first-order lagrangian 


& = 5 [D)"(0.W) — 6D y"V) — mb (3-152) 


We have, for instance, 


Of A 
ew aa.) 


and similarly when varying with respect to w. When w and yw satisfy the field 
equations the lagrangian (3-152) vanishes. Since canonical quantization does not 
work, we shall rather use the following procedure. Assume wW and w to be quantized 
operators acting in some Hilbert space. Let us inquire under which conditions the 
Poincaré group generators, built according to Noether’s theorem, transform 
the fields as required by the transformation laws studied in Chap. 2. Since these 
generators, as well as all observable quantities, are expressed in terms of the 
dynamical variables wy and w, this will ensure the covariance of the theory and 
we will then verify that these observables do commute at space-like separations. 

The energy momentum tensor density is obtained from (3-152), using the 
equations of motion as 


| 
= 5 7 8.U — mb +5 d,r"W) = 0 


Pe op ee oy — gue = | pd 3-153 
a’ gece 8 YE 
For lack of any prescription we have arbitrarily ordered this expression by setting 
W to the left of y. It is clear that we proceed heuristically. The tensor ©*” does 
not seem symmetric either. Let us therefore study the generalized angular 
momentum density taking correctly into account internal variables (i.e., spinor 
indices). Consider an infinitesimal homogeneous Lorentz transformation under 
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which we must have 


W(x) > W(x) = S(A)W(A™*x) = W(x) + 6W(x) 


i [ — 5 5i,gM% = M*%®,, = —MP*,, = i(g*sghy — 9°?) 
S(A) = I- ; SDa po? (3-154) 


i Co) ee ens : 
w(x) = — 5 ODap a + ilxte — x?d0*) I(x) 


— 5 Sedap L(x) 


The operator I7? = 40% + i(x7d’ — x*0*) acts on the spinor indices as well as the 
configuration variables. Correspondingly, 


Syp(x) = dyp(x)*y® 5 Wx\E*6 ep 


a@B 


with [28 = yO fr ty? — = — i(x* 98 — x? 92) 


If dw** is x independent we verify that the action is invariant. Thus when dw 
varies with x we find as a variation around the stationary point 


6l = | ats] hn & oh+ iho vn 2) yt \v ae 
- ~ [a (Ce a =0 


A candidate for the generalized angular momentum tensor density is therefore 


EB: o ot 
Jiak = suet me Oey i tly 


. (3-155) 

= x7@“F — xf Qe ee sy i oly 
We can verify directly that 0,J"“* = 0. This tensor has an orbital and a spin 
part. The above construction gives a similar role to w and w. The space integrals 
of © and J°* will yield the quantum generators of the group when we find 
a commutation prescription for the fields. We can also add that the invariance 
under phase transformations wy > e'“, we W leads to a conserved current 


Jt = Wy (3-156) 
which was used in the one-particle theory of Chap. 2 to normalize the states 
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when integrating J°. It will be necessary to give a more general interpretation 
in the quantized picture. 

j To implement our program let us now expand the operators w(x) and w(x) 

in terms of the c-number plane wave solutions of the Dirac equation, with 

operator-valued amplitudes b, b’, d, and dt: 


ak m ae 8 
W(x) = Qn a yy [b.(k)u(k) ee * + d}(k)v(k) e* | 
0 a=1,2 


(3-157) 


: dk | | 
W(x) = lap = a [ bi(k)ia(k) elk aan d,(k)5(k) ew ik =] 


The spinors u and v were given in the previous chapter, Eqs. (2-37) and (2-38). 
The operators b and d must satisfy commutation rules such that 


w(x + a) = eF (x) eP “4 
or in differential form 
OW) =i[P. WOO] 6,0) = [Pp WO] (3-158) 
We express P,, using the decomposition (3-157) with the result that 
ie fex ©°,= |S oo k, > [bi(k)b.{k) — da(k)dl(k)] (3-159) 


from which we have to subtract the vacuum contribution. We read from (3-159) 
that if the vacuum is defined in such a way that b,(k)|0> = d,(k)|0> = 0, and if 
we were to quantize according to commutators, b particles and d particles would 
contribute with opposite signs to the energy. The theory would not admit a stable 
ground state. 

From (3-158) translational invariance is satisfied provided 


Ber b.Ak)| = — Kk, DAk) [Ais d.(k)| = —k,d,(k) 
[ PiyDlky) = kei (k) [Po ditk)| = k,di(k) 


Replacing P,, by its expansion, this is equivalent to 


(3-160) 


k® 
b [bj(q)be(a) — dp(q)d}(Q)], bak = — (2m)? — 5%(k — q)balk) 


and three analogous relations. If we assume that 
[d}(q)de(q), ba(k)] = 0 


this edition can be written 
oe 
Y, [b4(a){bo(a), balK)} — {b5(q), balk} bel a)] = — 2)? | 5°CK — a)balk) 
B 


Therefore, we can insure the correct interpretation of energy and momentum 
using, as an alternative, anticommutators (the above curly brackets) instead of 
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commutators between the fundamental creation and annihilation operators b’, d’, 
b, and d. This solves at once the stability question mentioned above since, after 
vacuum energy subtraction, we are left with only positive contributions to the 
energy. We set, therefore, 


0 
{bull Dig} = 2m)? — 6k — a) dp 


‘ (3-161) 
k 
{d,(k), dj(q)} = (22)? a 6°(k — q)dap 
and all other anticommutators vanish. As a consequence, 
{Welt, x), WH(t, y)} = 6°(K — ye, (3-162) 


We may then easily verify that conditions (3-160) are fulfilled, together with the analogous ones 
stemming from the requirement of covariance under homogeneous transformations and involving 
the commutators of the generators J = f d?x J°’ with the field. The interpretation of (3-160) is 
that b‘(k) and d'(k) create [b(k) and d(k) destroy] a four-momentum k*. 


Wick products can be>generalized to Fermi fields. When reordering the 
creation operators to the left of the annihilation ones, a sign corresponding to the 
parity of the permutation must be introduced. The correct definition of total energy 
momentum is therefore 


‘ d*k m - ; 
pHe= (a po k ae : bi (k)b.(k) — d.Ak)di(k): 

a4 (3-163) 
= | ke YS [bi(k)ba(k) + di(k)d.(k)] 


iT aes 


leading to a sum of positive contributions to the energy of a quantum state. 
Let us elaborate the structure of the corresponding Fock space. 


3-3-2 Fock Space for Fermions 


We focus first on one-particle states. The necessary smearing in momentum space 
is implicitly understood. For a given four-momentum we observe that we now 
have a fourfold degeneracy. Denote the corresponding states la> (a= 1,2)345 


[I>=bi(Q)0> = f2>= bK)JO> = 3 =dI(K)JO> ~—|4> = ak() Jo 


They satisfy P,,|a> = k, |a>. We look for observables commuting with P, in order 
to distinguish these states. One of these observables is the charge (normalized 
to +1 for a one-particle state) 


Q= [ex 7) = ex W"(x)W(x): 


: (3-164) 
Z | ak YS [b4Kbalk) — dha) 
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When dealing with Dirac particles the notation dk will mean [a°k/(22)*](m/k°) 
instead of d°k/(27)°2ko as it is for bosons. Since the vacuum has zero charge and 


LQ, bi(k)] = bi(k) = (0, di(k)] = — i(k) 


we have 


_f+la a=1,2 
olay={* i a= 3.4 


In contradistinction to the “classical” case where we attempted to interpret (3-164) 
aS a positive square norm it is seen that in the quantum case this is not so. The 
use of anticommutators has totally reversed the situation; energy is now positive 
and charge an indefinite quantity. 

Dirac’s quantized theory therefore describes particles of two types. In electro- 
dynamics eQ will be the electric charge and yw will annihilate electrons and create 
positrons. Since [Q, P,,| = 0, Q is time independent, and [Q, w(x)| = —W(x) and 
[O, V(x] = WO. 

To characterize the one-particle states completely it remains to introduce 
spin. In analogy with (3-158) we have for Lorentz transformations 


py 
LJ", W(x)] = — icv — xd") + = Woo (3-165) 
Since 
b,(k) = | d3x @™(k) e*** y W(x) 
d}(k) = [ex B(k) eg *** yp(x) 
we find 
= } <— <_— ot” 0. (a) eras 
[J*, bE(k)] = | ax W(x) qe Gl Xx aeaya- a (k)e 
(3-166) 
= = HY 
[y, di(k)] = ~ [ax Bk) eB [ice a — x" 6") + Joos 
Consider now the action of the Pauli-Lubanski operator W, = —4€suvpJ""P? on a 


state |a>. The operator P? is replaced by its eigenvalue k’. Let t = (1,0,0,0) be 
the time axis and 


a normalized space-like four-vector orthogonal to k in the two-plane (t, k). Its space 
component is obviously along k. Choose the third axis along k. Then (W: n/m) |a> 
reduces to 


W-n ae |a> 
m = ‘ 
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When acting on the state |a> it is clear that the orbital contribution to J 12 
disappears, and since J‘? |0> = 0 we are left with 
W:- 
m 


 bi(k)|0> = a S ut(k) = u(k)b}(k) [0 
B 


To proceed we have to make a choice for the spinor basis u'”, v. We set 


u!(k) — ee ie 
./2m(k° + m) \ 0 


ee 
with Miler = \_ go” A) 
The corresponding states are helicity states. Since the third axis was chosen along 
k we find 
m o cya 
0 ut(k) uMk)=> OF a =le=—! 


with the result that 


W-n tla a=1 
LAMM ei ae (3-167) 
Similarly, 
W:- 
— dt(k)|0> = — > otto) 2 v(k)dh(k) JO 
Defining 
pk) = ee ( 0 
2m(k® + m) \x 
*k o@ a=1 
with leg = x 
[ky * {0 a= 2 
we obtain 
so Ok) SoM) =O ey = 1 op = 1 
W-n —tla az=3 
d = e 
an r |a> ae ae (3-168) 


Formulas (3-167) and (3-168) complete the characterization of states: |1> and 
|4> have helicity 3, |2> and |3> minus $. 
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We observe the typical inversion between the spinors u and v. It is the spinor v with spin o3 = —1 
which corresponds to positive helicity, in agreement with the hole interpretation. 


Consider now multiparticle states. Denote the creation operators by the 
collective symbol a‘, omitting therefore all indices of momentum, charge, or spin. 
A basis of Fock space is generated by the states 


at(1) 50K a‘(n) |O> 
From the anticommutation properties of the creation operators these states will 
be antisymmetric in the wave function arguments 1,...,n and, in particular, will 
vanish if two of those coincide. We recover in this formalism Pauli’s exclusion 
principle, valid for electrons and more generally for all identical fermions. 


Quantization using anticommutators has naturally generated Fermi-Dirac 
statistics. 


The reader might enjoy discussing the free relativistic electron gas at thermal equilibrium. What 
happens if only the total charge is conserved and not electron and positron numbers separately? 


3-3-3 Relation between Spin and Statistics—Propagator 


From the relations (3-161) follows the anticommutator of two free fields at arbitrary 
separations. Clearly w(x) and W(x’) anticommute; similarly for W(x), W(x), while 


Wel, Bel) = | dk fe“ uw (onaie (le) + 2 —” oPUKDELK)] (3-169) 


where & and &’ are Dirac indices. From (2-40) and (2-41) we have 


uPAR) = (= " Y oP(KoP(k) = (") 


It follows that 
{We(x), We(x’)} = (id, + m)zviA(x — x’) (3-170) 


where iA(x — y) is the expression encountered in (3-77) and (3-56) in the analogous 
case for Bose fields. We conclude from (3-170) that when (x — y) is a space-like 
interval the anticommutator vanishes. Moreover, if xy? = t, 


{ Welt, x), Welt, y)} = —V¥e GoA(x? — y®, x — y)|x0=y0 = yee 5°(x —y) (3-171) 


in agreement with (3-162). In the sense of anticommutation ip! appears as conjugate 
to w, as suggested by the lagrangian (3-152), even though we did not follow the 
procedure of canonical quantization. . aa 
We could wonder what would have followed from such canonical quantization. 
Spin 4 particles would appear as bosons and energy would not be bounded from 
below. Furthermore, the commutator between two fields would be given by an 
expression similar to (3-169) with a relative minus sign between the two terms on 
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the right-hand side. As a consequence, the result would read 
: d°k See) ik -(x—-y) 
(id, + m) | ase ore ) 


This integral is an even solution of the homogeneous Klein-Gordon equation 


: | ee 
aite— d= [spare tk-(x—y) 4 gik-(—y)) (3-172) 


which does not vanish for (x — y)* < 0. 


Study the large separation behavior of A,(x — y) in the space-like region. 


In other words, local observables would not commute at equal times and 
locality would be violated. Similar violations would occur in a scalar theory 
quantized according to Fermi statistics. 

These properties are instances of the relation between spin and statistics 
following from a local relativistic quantum theory. Half-integer spin fields have 
to be quantized according to Fermi-Dirac statistics, 1.e., with anticommutators, 
and integer spin fields according to Bose-Einstein statistics, i.e., with commutators. 
A more general proof can be given in an axiomatic framework. It is to some 
extent fascinating to note that such a deep connection, essential to the stability of 
matter in circumstances apparently very remote from the relativistic domain, does 
require these concepts. We do not know any alternative basis for it. 

The time-ordered product of two Dirac fields defined as 


TWAx)Pely) = O(x° — yWaxbely) — (v9 — We Q)WAx) (3-173) 
is such that 


THAbe(y) = Ol THO e()0> + WAX Pely): 


<O| TWAx)Wely) [0D = iS(x — yee (3-174) 
we dk —ik+(x-y k+m 
see 9= | aaee Oe ae 


which agrees with (2-113). We note that 
d* k e- ik>x 


S(x) = —(i9 + m)Gp(x) = (id + m) | aos are (3-175) 


Since G(x) is even in x it follows that 


S*(—x) = —(—i9? + m)GAx) 


With the help of the matrix C, introduced in (2-97), such that Cy#™C~1 = —y*, 
we have 


CS\(=xC-? = S(i) (3-176) 


a 
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This symmetry property generalizes the one of G,(x). In the next subsection we 


_ Shall investigate its meaning. 


3-4 DISCRETE SYMMETRIES 


Symmetries are, paradoxically, best understood when they are broken by external 
agents such as a measuring apparatus. Here we review briefly some discrete 
symmetries arising in the context of noninteracting quantum fields. Kinematic 
invariances have been discussed before and the subject of internal symmetries 
will be reconsidered in Chap. 11. 

From the active point of view we understand by symmetry a one-to-one 
correspondence between states or, rather, density matrices of pure states 
|p><o| > |" ><" | preserving transition probabilities 


<p] eM BFP? = [Cy] em HEP 14,9]? 


This definition is somehow restrictive since it singles out the time variable, so 
that pure Lorentz transformations or time reversal have to be treated in- 
dependently. This will be done below for the latter invariance. From Wigner’s 
theorem, any one-to-one mapping of pure-state density matrices p — p’, with the 
property that trace (p,p2) = trace(p,p2) can be extended to a unitary or antiunitary 
operator in Hilbert space. 


Starting from an implementation of the symmetry on density matrices, the phases of the states can 
be adjusted in such a way as to produce a linear (or antilinear) operator on the Hilbert space. A 
symmetry group will be generally realized through a projective representation with U(g1)U(g2) = 
2? (91-9) U(g1, g2) where w(g1, g2) is a phase. 


In the above restricted sense, every unitary operator U commuting with the 
hamiltonian generates a symmetry. While we shall mostly deal here with the 
Dirac field, the reader is invited to reconsider the cases of scalar or vector Bose 
fields. 


3-4-1 Parity 


We assume that it is possible, with appropriate modification of the experimental 
apparatus, to obtain the parity transformed state. From the field point of view we 
look for a unitary operator F satisfying 


PU(x)P* = npyW(X) — -X* =X, |np| = 1 (3-177) 


since from the previous chapter we know that y°y/(X) satisfies the parity transformed 
Dirac equation. We thus expect 7 to commute with H. Equation (3-177) entails 


PbAp)P* = nyb.(b) youl(p) = u(p) 


(3-178) 
Pa (pP' = —n3dlp) — y°v6) = —°) 
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Irrespective of the phase ,, the relative parity of a fermion-antifermion system 
is minus one and ¥% is a multiple of the identity. A unitary operator fulfilling 
the above requirements is 


~ ome Tl a ‘ 
FP =expi [a » fos ante + 5 bl — ath + n)d,(k) + 5 ish 
a=1,2 
(3-179) 
where J is arbitrary and n, = e~“***). In particular, n»=1 if 4= —n/2. A 
simple means of verifying that is indeed a symmetry is to observe that under 
a time translation 


eiHa b,(k) e ia a e7 tka b,(k) eiHa d,(k) e ita = e7 ik°a d,(k) 
Hence, 
eiHa PD e~ ita = P 


We may further verify that AP“P*t = P,. In a coupled theory the complete parity operator will be 
the product of those pertaining to the different fields. Construct in particular 7, for the electromagnetic 
field such that : 


PyAA"(X)P) = A,(X) (3-180) 
Show that the bilinear Dirac current has a similar behavior: 
Py: V(x)y"h(x): Pl =: WOy W(X): (3-181) 


from which follows, using d*x = d*x, that the interaction lagrangian 
\ atx :(x)y"(x): A,(x) 


is parity invariant. 


3-4-2 Charge Conjugation 


We have seen that quantized fields may describe particles of opposite charge with 
identical masses and spin. The corresponding charge may have a different inter- 
pretation according to the physical context. It can be electric, baryonic, leptonic, 
etc., whichever the case may be. Charge conjugation invariance therefore implies 


1. The existence of antiparticles 
2. The symmetric behavior of both kinds of quanta 


We had a first example with the charged scalar field. It may, however, occur 
that particles and antiparticles are identical. Such is the case for photons, where 
the corresponding operator @ just reverses the sign of the field 


CAA(x)6t = —A,(x) (3-182) 


for reasons to become clear below. 
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From Chap. 2 the corresponding action on a Dirac field is 
CW (x)6! = ne CW" (x) (3-183) 
where transposition refers only to Dirac indices. In the standard representation of 
y matrices 
c = iy?y? 
To be definite, we choose creation and annihilation operators for the helicity 
states b(k, +), d(k, +). Then 


k 
ee ee ("5 } (3-184a) 
2m(k° + m)\ 9 
where o° kok) = a 0+(k) p(k) = One 
and by definition 
E —Kk+m ( 0 
kot) = Ca? (k, +) = ——_—_—— - (3-184b) 
ae ./2m(m + k°) \x+(k) 
In the usual representation y+(k) = — io, i(k), and we can verify that 
o-kys(k) = +y+(k). Furthermore, 
Co7(k, +) = u(k, +) (3-185) 


With these choices we readily derive from (3-183) that 
b(k, +)" = ned(k, +) 
€d'(k, eye = neb'(k, ae) 


This could have been imposed at first, with (3-183) following as a consequence. 
Up to a phase factor, @ interchanges particles and antiparticles with the same 
momentum, energy, and helicity. The vacuum is left invariant. An explicit 
expression for @ is 


Cy = 6 :b. (3-187) 


(3-186) 


€, =exp —i {at Y Afbt(k, e)b(k, €) — dt(k, e)d(k, €)] 
_ (3-188) 
Tt 
2 
with 7, = e'*. The only effect of @, is to carry this phase, and y. = 1 corresponds 
to @, = 1. The reader will also check that the current : yy, : is odd under charge 
conjugation. 


€, =expi {a YS. [bt(k, e) — dt(k, e)] [b(k &) — alk, 2] 
e= +1 


As an application let us classify, according to charge conjugation, the lowest bound states of a 
fermion-antifermion system, the prototype of which is positronium. iid 

The latter is an (e*e ) system analogous to the hydrogen atom (pe ) with the proton en 
by a positron. A nonrelativistic description is justified as a first approximation, due to the weakness 
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of electromagnetic binding forces. The ground state is an s wave, n = 1, but hyperfine effects split a 
triplet orthopositronium state (°5j, if we use the notation 75*!L,) from a singlet (So) parapositronium 
state. Simplified wave functions correct from the quantum number point of view are written, using a 
fixed axis for spin quantization, as 


|J = 1, M =0, ortho) = [ea ei(|q|)[bt (qd (—q) + b1.(q)d* (—4)]} |0> 


|J = 0, M =0, para> = [ea o(|q|) [bt (q)dt. (—q) — bt (q)d" (—q)] 0> 


The relative momentum wave functions gy, and @o only depend on the magnitude of q; 
b4.(q) [or d‘.(q)] denotes an electron (positron)-creation operator of momentum q with spin +4 
along a fixed axis. Charge conjugation reads 


€b',(q)@t = ntdi.(q) — €d',(q)€" = n-b4 (a) 
The arbitrary phase 7, disappears when @ acts on these states with the result that 


€ |ortho> = —|ortho> 
(3-189) 
@ |para> = |para> 


The signs arise as follows. Charge conjugation interchanges electron and positron, as a result of which 
the relative momentum changes sign, leading to a factor of (— 1)" = 1 for s waves; the spin indices 
are interchanged, leading to a plus (minus) sign for a triplet (singlet) state. Finally, there is an additional 
minus sign arising from the anticommutation of b' and d' operators. This is an indirect and unexpected 
manifestation of Fermi-Dirac statistics. 

The positronium states are unstable and have a slow decay by photon emission. From (3-182) 
the electromagnetic potential is odd under @. This is, in fact, a condition for electromagnetic inter- 
actions to be invariant under @. Hence an n-photon state behaves as 


@ |n> =(-1)"|n> 


Correspondingly, orthopositronium must decay into an odd number of photons, and parapositronium 
into an even number. One photon decay is forbiaden for the ortho state by energy momentum 
conservation. It must decay into at least three photons, while parapositronium can decay into two 
photons and has therefore a much shorter lifetime. The coupling constant being the fine structure 
constant «, for the lifetime t we expect the ratio tingtet/Tiripler ~ (a). We shall compute these quantities 
in Chap. 5. To lowest order in a, 


Is =Tt = 557 = 0.53 x 10 : eV t= 1.25 x 1Omey s 
3-190 
-1 ee 6 - 
ie — 2 mo°=4.75x10°-°eV = t, = 1410 x 10775 


A numerical accident makes t, smaller by an order of magnitude than at,. The hyperfine splitting 
will be discussed in Chap. 10. 


3-4-3 Time Reversal 


Classically the meaning of time-reversal invariance is clear. By reversing the 
velocities in what used to be the final configuration, a system retraces its way back 
to some original configuration, if the fundamental dynamics enjoys this invariance. 
Hence initial and final states are interchanged with identical positions and opposite 
velocities. This interchange has the consequence that in quantum mechanics the 
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corresponding operator 7 is antiunitary, that is, 
Jel|FW=Wle> FtFT=1 
The requirement of invariance under 7 means that 
FHI =H 


if H is the hamiltonian. We assume time-translation invariance throughout. 
Consequently, 


TF e~iWtz-ty) TF* = e7 tHt-t) 


The transition amplitude from the state |g;> at time t; to the state |p,> at time 
ty is equal to the corresponding amplitude from state |7@y,> at time ft; to 
|7 y;> at time ty. Indeed, 


Kose? loi) = Fos (Fe F")| Toi" 
= (F p,| eM) |Fy,)* (3-191) 
= <F oe“ |Fo,> 
Since momenta, but not positions, are reversed, orbital angular momenta change 
sign under 7, and so must spins. In particular, helicities are unchanged. 
For a scalar relativistic quantum field @ satisfying the Klein-Gordon equation, 
TF ot, x)7* equals w(—t,x), possibly up to a sign (since @ is hermitian). This 
leads to : 
TF a{k)7* = na(k) 


eri (3-192) 
T ot, x)7* = no(—t, x) 


For the electromagnetic potential the corresponding transformation is 
TA"(x)T* = A,(—X) (3-193) 


Consider finally a spinor field, and let us choose the helicity basis. The require- 
ment is . 
T b'(k, e)7* = n¥bt(k, e) ef se) 


TFT a'(k, )\F* = nrdt(k, 2) e~ ele 


with 17 a fixed phase. We want to choose 6,(k, ¢) and 6,(k, ¢) in such a way that 
TW(x)F* satisfies the time-reversed Dirac equation. From the antilinear character 
of 7 we obtain 


(3-194) 


FWOT* = nr | a Y [b(K, e)ur(ke, ee! clk 4 dt(K, e)v*(k, ee HM eH] 
The spinors u and v are defined in (3-184). We change k into k in the integral. 
If there exists a fixed matrix A such that 

Au(k, @) = e- 9&2) y*(k, e) 


i (3-195) 
Av(k, 8) = e— he) y*(K, e) 
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then we shall have 
TWt,x)T" = nr AW(—t,x) (3-196) 
From the fact that y° is hermitian and y antihermitian, it follows that 
(KT — m)u*(k,e)=0 (KT + m)v*(k, 2) = 0 
Multiplication by y°C yields 
(kK — m)ySCu*(k 2)=0 (K+ m)y>Cv*(k, 2) = 0 


Consequently, we expect that A~! is equal to y°C up to a phase provided that 
the two component spinors g+(k) and g+(—k) are correctly related. Indeed, in 
the standard representation of y matrices 


y>Cu*(k, e)= ge - a 


./2m(k® + m) 0 
— /tm(ko + m)\ 0 ~io2 0 


Therefore all that is required is that —io,p*(—k) be equal to ok) up to a 
phase. This is the case since —io2 is unitary and 


(6° k)[—io.p*(—k)] = —io2[-(6- k)*%(—f)] = e[—-iorp*(—k)] 
Hence 
—io2.9%(—h) = e% —(k) 
and y>Cu*(k, 2) = ef) uk, ) 


Now (y°C)~! = —y°C; thus, up to a factor which can be absorbed in nz we have 
indeed found the matrix A required in Eq. (3-195), at least for the u spinors. A 
similar calculation holds for v. The standard choice of phase is 


A= iyly? = —iPC (3-197) 
and Eq. (3-196) is finally established. : 


3-4-4 Summary 


For the Dirac field we can now summarize the various transformation properties 
of quadratic forms under discrete symmetries. Define them according to their 
tensor character : 


S(x) = :W(x)W(x): 
V(x) =: W(x)y"h(x): 
T(x) = :(x)o""(x): (3-198) 
A*(x) =: W(x)p>y"h(x): 
P(x) = i: W(x) W(x): 
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The last factor i is chosen for hermiticity. The symbol A for the pseudovector 
(or “axial” vector) current (see below) is conventional and should, of course, not 
be confused with the vector potential. Using the results of the previous subsections 
and keeping in mind that the field anticommute and that 7 is antiunitary, we 
' have the following table: 


S(x) V(x) TY) A*(x) P(x) 
= =| 
PF | S(x) V(X) ElX) —A,(X) — P(x) 
aS (x) — V"(x) — T"(x) A*(x) P(x) (3-199) 
F\S(-%) = V—=) -To(-3) A,(-2) — P(-9) 
© | S(—x) —V*(—x) T*’(—x) —A*(—x) P(—x) 


In this table each entry represents the action of the operator on the corresponding 
density. For instance, 7 T**(x)7* = —T,,(— x). We have added a line for the 
combined © = Y¢7 antiunitary operation. The corresponding transformation 
laws of the electromagnetic vector potential A“, (x) are 


PAL (Pt = AL, (%) CAH (x)6t = — AL (x) 


F- Gt . t ay 
TA (XT! = 2 Gee CAO! = —Al (=x) 


As a result the combination 


W(x}yyW(x): AZ (x) 


behaves as a scalar density S(x). Finally, the behavior of A“(x) and P(x) under 
parity justifies the denominations pseudovector current and pseudoscalar density 
respectively. All phase arbitrariness has disappeared from table (3-199). 

In this section we have discussed the most frequent discrete symmetries and 
constructed the corresponding unitary (or antiunitary) operators in terms of the 
fundamental free fields. The necessity of such a construction is clear, for only 
then can we be sure that the transformed states do exist. It is a different matter 
to inquire whether the dynamics is left invariant when interactions are taken into 
account. This requires us to examine whether UHU' = H. When we consider such 
interacting fields, we shall try to define the symmetry operators acting on the 
states and the fields according to the above rules. The question will then be: 
Is the invariance implemented at the dynamical level? 

A fundamental property of local quantum theory first discussed by Pauli, 
Zumino, and Schwinger states that, in any case, © remains an invariance of the 
theory. This is the famous PCT theorem. We sketch here a proof restricted to 
lagrangian field theory, referring the rigorous-minded reader to the works quoted 
in the notes for a more general treatment. Let a local quantum field theory be 
described by an action principle involving an hermitian Lorentz-invariant lagran- 
gian. This is a combination of local scalar densities expressed in terms of the 
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basic fields, eventually normal ordered. The quantization respects the spin statistics 
relation. The PCT theorem then states that even if P, C, and T are not separately 
invariances, their combination is. This implies, in particular, the existence of anti- 
particles for charged fields (with masses and spins equal to those of the corre- 
sponding particles) and the particle-antiparticle identification for neutral fields. 
In summary we want to show that the lagrangian (x) behaves under @ as a 
neutral scalar field 


OYF(x)Ot = L(—x) (3-201) 
We should have added that for a scalar field in general 
Oo(x)Ot = o'(—x) (3-202) 


If (3-201) is satisfied the action will remain invariant. We shall be satisfied here 
with a lagrangian depending on scalar (@), vector (A,,), and spinor (yw) fields possibly 
carrying internal indices. The theorem remains true for higher spin fields. It is 
clear that Y can be constructed using 4,, W, W and only hermitian scalar fields 
combining if necessary the charged ones into 9; =(y+ 9')/,/2 and 2 = 


(p— pVi,/2 (Og;(x)O' = oe —x)). We can then summarize the action of © as 
follows: 


— 


It changes the arguments.of fields from x to —x. 

2. As a consequence derivatives change sign, 0, + —0,. 

3. A vector field A*(x) gets an extra minus sign, 1.e., behaves as the gradient of a 
scalar field. 

4, Any quadratic form in (w, W) gets a sign (—1)’ where P denotes the number 
of Lorentz indices carried by y matrices or derivatives. This follows from table 
(3-199) and rule 2. 

5. Finally, constants are complex conjugated. 


Since ¥ is a scalar, each term in a monomial expansion is obtained by contracting 
an even number of Lorentz indices. The minus signs coming from vector fields, 
derivatives, or quadratic forms in (w, W) are cancelled. The net effect is thus to 
change Y(x) into Y'(—x) = ¥(—x), since the order of operators is irrelevant 
under normal ordering as vector and scalar fields commute and the anti- 
commutation of the y fields has already been taken into account in (3-199). As 
a consequence, if the vacuum is invariant under ©, so will be the dynamics. 


It is interesting to check that equal-time commutation rules are invariant under ©. This involves 
some subtlety since interactions may modify the expression of conjugate variables. Such is the case, 
for instance, for electromagnetic interactions of charged bosons and more generally when the interaction 
part of # involves derivatives. We may also verify that the hamiltonian density behaves as 


O¥(0, x)@' = ¥(0, —x) 


and thus @HO't=H 
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The above proof ignores the difficulties associated with the construction of an 
interacting field theory. The latter has to give a precise meaning to otherwise 
undefined products of operators. Nevertheless, the algebraic properties implied 

in the PCT theorem are preserved by this operation. 


As an application we discuss form factors, a relativistic generalization of charge distributions. 
Consider first the matrix elements of the current V“(x) of a free Dirac theory between two one- 
particle states 


<p’, B| V*(x)|p,a> = e -P9* Cp’, B| V0) |p, o> 
The greek indices «, B,...,6 denote polarizations, |p, a> = bi(p)|0>, and 


A) = ails) {fa 1 dkz Ya" (k, pul (k)bY les )Balka) + 
yd 
We have not explicitly given the terms with vanishing contribution to the matrix element. From 
. Ya 
eng MIE Beni 
it follows that 
<p’, B| V(0) |p, «> = a (py4u(p) 


This is in agreement with the definition of the charge 


<p’, B| O |p, «> = [ex <p’, B| V°(0, x) |p, «> 


S [es eilP’—P)-x f(p'yy°u™(p) 


= (2n)°5°(p — p)a(p')y°u(p) = <p’, B| p, 2 


Recall that iis(p’)°u.(p) is normalized to 54(p°/m). 

Assume now that j“(x) is an hermitian four-vector current density with the same transformation 
properties under discrete symmetries as V(x), and let us write the general structure of its one-particle 
matrix element 


<p’, Bl i*(x) |p,x> = "PP * (p'})O%(D, p)u(p) 


In the sequel we always understand that the numerical matrix O*(p’, p) is sandwiched between the 
two spinors i and u. From Lorentz covariance O*(p', p) must obey 


S(A)O*(p', p)S~ *(A) = (A~*)*,0"(Ap’, Ap) 
and y°O*(p’, p)'y° = O*(p’, p) 
from hermiticity. We can take into account the fact that the spinors satisfy the free Dirac equation 
by replacing any O“(p’, p) by 
pom 


ptm 
O*(p’, p) ——_ 
aS (p’, p) a 


O*(p', p) > 
In particular this entails the Gordon identity 


1 oe 
y* + — [(p' + py + io*(p' — p)v] 
2m 


160 QUANTUM FIELD THEORY 


We also require the current j*(x) to be conserved 


0 = <p’, Bl 6,,j7*(x) |p, o> = i(p’ — p),<p’, Bl j*(x) |p, &? 


4 . B U * a =, 2 
Thus (p’ - p),O“(p’, p) vanishes. Let us note q, the space-like difference p’ — p; since p a loa 
the only scalar invariant is g?. Allowing for the most general decomposition of O* on the basis of the 
16 Dirac matrices, and using Lorentz invariance, conservation, and hermiticity, we find 


ican rn emit app Meee Get 8 Fagd* pmugty | is 3-203 
u(p')O*(p', pu! (p) = #PXp')| y*F \(q*) + i ~ F.(q7) + 7 — ie cee ) 


with F,(q’) being real form factors —functions of the square momentum transfer q’. Quantizing spin 
along a fixed axis we derive from Eqs. (3-178) and (3-199) that symmetry under parity implies 


<p’, Bl i*(x) |p, o> = <B', Bliu(%) |B, o> 


Since u(p) = y°u'(p) this means O*(p’, p) = 7°O,(p’, p)y°, the consequence of which is F3(q7) = 
—F3(q’). Thus parity conservation alone yields F; = 0. However, it could happen that parity is 
violated by some interaction (in practice, weak interactions). 

It was long believed that parity times charge conjugation remains a symmetry of all interactions. 
From the PCT theorem this meant that T was also a symmetry. This imposes F*(q?) = e;F\(q?) with 
& = €2 = —é3; = 1. From hermiticity we know that the F functions are real. Therefore again F; = 0. 
It is known, however, that T is not a symmetry of all interactions, since the discovery of its violation 
by Fitch and Cronin in the neutral K meson decays. 

As an example let us assume that j#(x) is the electromagnetic current of the interacting theory. 
By considering two normalized states close to rest, the reader will have no difficulty in showing that 
the following correspondence holds: 


eF, (0) = charge Q 


— [ F,(0) + F2(0)] = magnetic moment » (nonrelativistic magnetic coupling — po - B) 
m 


ae 


om F3(0) = electric dipole moment d_ (nonrelativistic electric coupling — da - E). 
m 


Extensive measurements of nucleon-form factors have been performed using electron scattering off 
hydrogen and deuterium targets. For this case F; and F; are now known in a wide range of q’. 
However, up to now F3 has not shown up. In particular, bounds for the static electric dipole moment 
of the neutron have been reported (Ramsey) showing d neutron/e < 10773 cm. The magnitude of this 
electric dipole moment is a sensitive test of the mechanism of CP violation. 

For a charged particle, the total gyromagnetic ratio is 2[F,(0) + F2(0)]/F,(0). For the proton, 
for instance (with the convention e = |e|), F,(0) = 1, F2(0) = 1.79, and the gyromagnetic ratio is 
5.58. In the literature we often find the Sachs combinations Gg; = F, + (q?/4m?)F, and Gy = F, + F 
which are more convenient for displaying the experimental data on scattering. These form factors 
extend analytically to positive (time-like) values of q? where they are related to the process photon > 
fermion + antifermion. 

In the case of scalar (or pseudoscalar) particles the corresponding matrix element of the current 
has the form 

! tod 
<P']j*(x) |p> = et" * eer F(q’) (3-204) 

We close this section with an algebraic exercise. It deals with quartic products of Dirac fields 
organized in covariant scalar densities as they occur for instance in the effective Fermi lagrangian 
for low-energy weak interactions. Its purpose is to examine the effect of rearranging the order in 
which these fields are coupled together to obtain a Lorentz scalar, the basic blocks being the quadratic 
quantities occurring in table (3-199). This is the Fierz theorem. Denote collectively by I? the 16 Dirac 
matrices 
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Ts I¢ ie T4 Tp 


i 
2 


(3-205) 


bye. + ot” = ~ [%, y”] ypryt iy 


_ Set P, = (1) "' and observe that y°(I*)'y° = I in such a way that the forms py are hermitian. 
With these notations 


Tr "ls = 46%, espe fics IO (3-206) 
Any 4 x 4 matrix X has an expansion 
Ka hrs (Xe) = $0, Tr (XT*) 


By identifying the coefficient of X,, we find the following identity where Latin indices run from 
1to 4: 


SaaOnb = aT, aT Sp (3-207) 
Applying this to TI, we have 
I'l s=p%,I” p%s, =aTrIT,!, (3-208) 


Writing , = e,1* where c, = +1, it is readily seen that p.,, = €,9%,, Is invariant by cyclic permutation 
and that given the values of two of its indices it is nonvanishing for only one value of the third. 
The only nonvanishing terms are 


P"ps = psp = Op 
and PY,T,-V, = —PV,V.Toe = UG onGve ~ GuoGve) 
PV TpgAy — PT; VA, — "uvoo 
PY.AP = —PAV,P = —"9uv 
Pr.,A,A, = ~PA,T,A, = 1GueIvo — JupIve) (3-209) 
PT RPTL P 1, Tye HBYDO 
PT ToeTe = *LGuplGrJea — JveIro) + Jua(GveJ ox — YveFep) 
+ GuclGvoJ oe — JvpGor) + GuelJroIvp — Ive9 pr)] 
Consider now the five Lorentz scalars 
s = u(4)u(2)a(3)u(1) 
v = ui(4)y*u(2)u(3)y,u(1) 
t = 4u(4)o"'u(2)u(3)o,u(1) (3-210) 
a = u(4)y* y4u(2)a(3)y,y°u(1) 
p = i1(4)y°u(2)u(3)y°u(1) 
Here u(1) denotes u(p;, 1), the spinor with momentum p, and polarization a. We shall use the 


symbol s(4,2; 3, 1) to characterize the way in which the Dirac indices are contracted, and similarly 
for the other amplitudes. Any one of these quantities may be written as 


b(4, 2; 3; 1) = Hia,(3)ita,(4) To ,0-T 2,aja,Ua,(1 ua,(2) 


How are these quantities related to the corresponding ones where 4 is contracted with 1 and 3 with 2? 
The theorem states that there exists a numerical 5 x 5 matrix F relating the two sets of quantities. 
This matrix will then necessarily be equal to its inverse. Before proceeding, let us remark that we 
could ask a similar question in x space with operators of the type » Walx)ba(x)ha(x)h 1(x):. The 
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relevant matrix would then be — F due to Fermi field anticommutation. In any of the five quantities 
b(4, 2; 3, 1) we can use Eq. (3-207) to rewrite 


Ua(2)Ua (1) = Saya, 50,a,Ha,(2)Ua (1) = 20 p,0,0,0 8 2,ta,(2)ua,(1) 
Thus, with the help of (3-208), 
b(4, 2; 3, 1) = 4140,(4)ha(3)PSsa.F pas F a,a30;1 4,2, Ma,(2)ua,(1) 
= $p%p,paP*u(4)Pu(1)u(3)T su(2) 


It remains to use (3-209) to obtain the desired result: 


1 1 1 1 1 
4 =2 © @ =< 
(4,2;3,1)=4 of So ae. ae (4, 1; 3,2) 


s 
v 
t 
a 
Pp -1 1 -1 1 


The reader may check that F? = 1 and diagonalize the matrix. What is the behavior of the various 
amplitudes under the discrete symmetries? 


NOTES 


The physical meaning of the quantum commutation relations in electrodynamics 
is analyzed in N. Bohr and L. Rosenfeld, Kgl. Danske Videnskab. Selsk. Mat.-Fys. 
Medd., vol. 12, p. 8, 1933, and Phys. Rev., vol. 78, p. 794, 1950. For limits on the 
photon mass see A. S. Goldhaber and M. M. Nieto, Rev. Mod. Phys., vol. 43, p. 277, 
1971. Macroscopic effects of vacuum fluctuations were considered by H. B. G. 
Casimir, Proc. Kon. Ned. Akad. Wetenschap., ser. B, vol. 51, p. 793, 1948. See also 
M. Fierz, Helv. Phys. Acta, vol. 33, p. 855, 1960; T. M. Boyer, Annals of Physics 
(New York), vol. 56, p. 474, 1970; and R. Balian and B. Duplantier, Annals of 
Physics (New York), vol. 112, p. 165, 1978. Experimental evidence is discussed by 
M. J. Sparnaay, Physica, vol. 24, p. 751, 1958. A study on Van der Waals forces is 
found in the work of G. Feinberg and J. Sucher, Phys. Rev., ser. A, vol. 2, p. 2395, 
1970. Quantum field theory at finite temperature is presented, for instance, in’ 
L. Dolan and R. Jackiw, Phys. Rev., ser. D, vol. 9, p. 3320, 1974. 

For a general axiomatic formulation of field theory and a derivation of some 
fundamental properties, see R. F. Streater and A. S. Wightman, “PCT, Spin and 
Statistics, and All That,” Benjamin, New York, 1964, and R. Jost, “The General 
Theory of Quantized Fields,” AMS, Providence, R.L., 1965. 


CHAPTER 


FOUR 
INTERACTION WITH AN EXTERNAL FIELD 


The interaction with an external field yields a simple example of a dynamical 
system. We introduce the important concepts of interaction representation and 
Wick’s identities. Applications include the radiation of a classical source and the 
infrared catastrophe, the physical counterpart of which in the fermionic case 1s 
the process of pair creation by a c-number electromagnetic field. 


4-1 QUANTIZED ELECTROMAGNETIC FIELD 
INTERACTING WITH A CLASSICAL SOURCE 


4-1-1 Emission Probabilities 


We shall first consider the interaction of the quantized electromagnetic field 
with an external given source. The physical problem is thus the emission or 
absorption of photons by a classical conserved current j,(x): 


Oy j"(x) = 0 (4-1) 

From the electromagnetic action 
I=- [at AF yF" + j,A") (4-2) 

we derive the equation of motion 
C= a\4"—po-A = j* (4-3) 
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As in the free case, we encounter a difficulty in the quantization of this field, 
which may be solved by the addition to the action of the term 


~3 | ate(o-ay 


In this gauge, the equation of motion reduces to 
DA* = j* (4-4) 


The operators A,(x) act in a space with an indefinite metric and satisfy the 
same commutation relations as in the free case 


Aste x), A(t, y)| = — Jyyid?(x a y) (4-5) 


If we assume that the solution of (4-4) is defined on the same Fock space as the 
free field, the relation between the free and interacting fields amounts to a canonical 
transformation. In quantum mechanics with a finite number of degrees of freedom, 
this would be implemented by a unitary transformation. However, for an infinite 
number of degrees of freedom, there may be a difficulty. We illustrate this point 
with a simple model. 

Let a finite quantum system be composed of N half-integer spins at the 
vertices of a three-dimensional cubic lattice. They may carry, for instance, a 
magnetic moment. The observables are the 3N operators o;(n), o2({n), o3(n), n 
being the site index. The states are linear combinations of the eigenvectors of 
the o3, denoted by |+ +::-+). They are generated by the action of the o- 
upon the state |0> =|+ +--- +. Consider now the rotated operators 


T1(n) = o,(n) cos 6 — 63(n) sin 8 

T2(n) = 62(n) 

T3(n) = o,(n) sin 8 + a3(n) cos 6 
Clearly, 

come peal 
Tq(n) = Xp | -: ~o 30) | o4(n) exp I > 2, Falp) 
2 p= 1 2 p=1 

This means that the operators o and t which generate the same algebra are 
unitarily equivalent. Let |@> denote the new ground state of the 7: 


|@> = exp |-i5 3 o2(0)|0> 


It is easy to compute the scalar product <0|@> = (cos 6/2)". Assume that in the 
limit N — oo, the Hilbert space of states is constructed from the ground state 
|0> =|+ +--+ +> by the action of a finite number of creation operators o_ and 
Cauchy completion. We may again rotate the operators o into t and define the 
state |9> in analogy with |0>. We may then seek a unitary transformation U(6) 
which would implement this rotation: 


|8>=UO)|0> x(n) = UPo,(n) UT) 
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Such a unitary operator clearly does not exist when N > oo. Any scalar product 
of a rotated state with a nonrotated one vanishes in this limit: for instance, 
<0|@> = (cos 6/2)" yaz 0. The moral is that, in the case of an infinite number of 
degrees of freedom, physically equivalent observables, i.e., realizing the same 
algebra of commutation relations, etc., are not necessarily unitarily equivalent. 
We have to keep this in mind when we observe that our field A, satisfies the 
free commutation relations (4-5). 

Returning to the field equation (4-4), we may write a particular c-number 
solution 


At(x) = fay G(x — y)j*) 
expressed in terms of some Green function G(x — y) of the d’Alembertian operator 
Oi: 
D1.G(x — y) = 64(x — y) 
The general solution of Eq. (4-4) is therefore 


A, (x) = Aj?'(x) + | d*y G(x — y)j,ly) (4-6) 


where A is a quantum free field. The precise form of Green function is specified 
by the boundary conditions. Assume that the current j”(y) has been switched on 
adiabatically on a finite time interval. Using the advanced and retarded Green 
functions of Eq. (1-170) : 


1 A e 
Gu t= — aes | deo 


—ip-x 


eos <12) 
2n 


we may write 


A*(x) = Afn(x) + [ay Gre(x — yi") 
= At,.(x) + [ety Gaar(x — y)i*(y) (4-7) 


The free fields A¢, and A#4,, describe the photon field before and after its inter- 
action with the current j. In a formal sense, we have 


lim A*(x) = Af,(x) 


(4-8) 
lim Ac) = As, (x) 


The precise mathematical meaning of these expressions depends on the source j. 
We may require at least that the matrix elements of some local average of the 
fields between normalized states satisfy these relations. This weak limit relies 
crucially on the assumption of an adiabatic vanishing of the source as |t| + 00. 
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This is often not actually realized and therefore Eq. (4-8) will have to be taken 
with a grain of salt. 

The construction takes place in a given Hilbert space, the Fock space of 
incoming photons, for instance. The vacuum is then the state annihilated by the 
operators a'?)(k), and we try to find the canonical transformation represented by 
a unitary operator § which connects the in- and out-fields, that is, 


Abu(x) = S~* Afa(x)S (4-9) 
and in- and out-states according to 
lout) = San — Stn) 
or |in> = S |out> 


(4-10) 


Suppose that at time t = —oo the system is in a definite state, the vacuum for 
instance, i.e., contains no (physical) photon. The final state has some computable 
probability to contain zero, one, two, etc., emitted photons. For example, the 
probability amplitude to remain in the ground state is 


<0 out|0 in> = <0 in|S|0 in> = <0 out|S|0 out> 


For this interpretation to make sense, we should verify that the probability 
Po = |<0 out|0 in|? is less than one. The probabilities p,, p2,..., the final polariza- 
tions, and the angular distributions are computable in an analogous way. There- 
fore, the operator S contains all the information about the final state. 

We now turn to its determination. Equation (4-7) implies that 


Abur(x) == Af, (x) aR [ey pGeelX — y) od Grats <A y) i") 
(4-11) 
= Afn(x) + Aa(x) 
The second term of the right-hand side of (4-11) is, of course, a solution of the 
homogeneous equation, and is nothing but the classical field A4, radiated by ' 


the current j (1-206). The combination G‘~) = G,., — Gag, has already been en- 
countered in Chap. 1 [see Eq. (1-173)]: 


G(x) = Gre(x) — Gaav(x) = On: | e~ P** 8(p9)(p?) 
= - €(x°) d(x?) = — A(x) 


It coincides up to a sign with the commutator A of scalar massless free fields 
(3-56). This enables us to rewrite (4-11) as 


Abui(x) = S7* Ain(x)S = Afn(x) — i [ey [Ain(x), Ain(y) JO] (4-12) 
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This is reminiscent of the formula (2-81): 
e“Be-4=B+[A, B]+35 (LA, [A, B]] + - 


Indeed, only the first two terms of the right-hand side do not vanish, since A 
and B represent free fields, the commutator of which is a c number. Therefore, 
we may write 


S = exp E | atx Ain(x) “(0 


= exp E [atx Aout(x) *j 09 | 


This form does satisfy all the conditions, including unitarity in the indefinite 
metric space. Only a c-number phase, depending possibly on j, is still arbitrary 
in S; the latter does not affect the physical quantities. 

It is convenient to rewrite S in normal order. We decompose Af, as a sum 
of annihilation A‘‘*) and creation A¥‘~) operators. We observe that 0, AM) = 
6, A4\t), and therefore S must leave the positive metric physical subspace invariant. 
The commutator of A““~ and A”*? is a c-number function: 


(4-13) 


e 9 9(k°)5(k?) (4-14) 


4 
[AK (x), Ax Y(y)] _ gt’ | 


Therefore we may use the identity 
e4 eB — gA+BtIA, By/2 (4-15) 


which is valid whenever [ A, [A, B]} = [B,[A, B]] = 0, and write the normal form 
of S: 


S = exp E [ey Ai,(y) “i0)) 
= exp | - i ey A‘ (y) 10 | exp E ey Ay {) 0 (4-16) 
x exp i d*y [AS (x) - f(x), Air?) i on 
We introduce the Fourier transform of the current j(x): 
Jk) — | atin ee (4-17) 


The real character of j and its conservation law are expressed by 
J*(—k) = J**(k) and k,J*(k) = 9 (4-18) 
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The exponent of the last term in Eq. (4-16) is 
1 CP ha at) ; (+) . 1 a*k = J* ky) Hk 
5] d xd y [AG (x0) > flea iA (vy) JO = 5 15 2k°(2n)3 ( }e ( )| po ahi 


As expected, only the Fourier components of the source corresponding to light- 
like arguments do contribute. Moreover, for k? = 0, we may decompose 
J*k) = k*J,(k) + Ji(k) (4-19) 


where J,(k) is a number and Ji(k) is a space-like vector orthogonal to k. For 
instance, if k = (kk) , we introduce the space-like four-vectors €; = (0, e1), 2 = 
(0, e2) with ej = e3 = ‘) e,°e@) =e,°k =e.:k =0. We may then choose Jii(k) = 

— js =1,2 Ji(k)et(k) with J(k) = ¢;- J(k). Using this decomposition, it is easy to 
see that 


J*(k) + I(k) = TECK) + Iuelk) = —[Ja(k)|? + |J2(K)|7] 


In other words, only transverse components contribute to the last term of (4-16): 


S = exp | - [ats A Ge) 7) exp E [as Ay ie) 7) 
x exp |- 5 | dk [|J.(k)|? + 00/71} (4-20) 


The probabilities of emission are expressed in terms of these transverse com- 
ponents. Indeed, for po we find 


Po = |<O out|0in>|? = |<Oin|S|0 in>|? 


To compute the probability p, corresponding to the emission of n photons when 
neither the momenta nor the polarizations are observed, we recall that an n‘ 
photon state is expressed by 


[Ki A1, koda,...skndnd = al(K1) +++ atA(K,) 10> 
with the normalization 
(hyd... dn RA ent 
= D9 a. 95, Sax, (20)° 2k1 5°(Ky — Kp,)--- (22)? 2k2 53(k, — Kp.) 
where the sum runs over permutations. 


Owing to Bose statistics, the projector over the n photon states reads 


1 a ne 
i= | a, dy ps ; lic As... Riga) <Jegianeee ean (4-22) 
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We have therefore to consider the matrix element 


Sh Assess Rare out|0 in> = 7 he e/a Ue in|S|0 in> 
exp {4 | aktls.09) + jss0081} 


x Slates Kade in| exp - L [ax Af, (x) 40s |o in) 


(4-23) 
Now 


{as Al—)(x) + io) = | ak y at (ke (k) « I(k) 


= | dk at (k) J,(k) 


A=1,2 


Current conservation has again limited the sum to the transverse degrees of 
freedom; if we include the “longitudinal” and “scalar” photons in the projector 
P,,, their contribution cancels automatically. In Eq. (4-23), the term with n creation 
operators gives the only contribution: 
(ts ae : : a 
ky Alyse knAn| {ea aa dGn a’? (q1)Jo,(41) ies a‘ Tn) Jo,(Qn) |0> 

= (—i)"Ja,(k1)°**Sa(Kn) = (4-24) 


The factor 1/n! has disappeared, since there were n! equal terms. The probability 
Pn reads 


pa— <0 in| P,|0in> 


7 2} |aatlor? (of exp - [ak tise - J2tt")} (4-25) 


Defining 
i= | aetna? 4 | Jo(k)|7] (4-26) 
we obtain the Poisson distribution 
re 
=e "— 4-27 
The distribution is normalized: 


» Pr = 1 


0 
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and the average number of emitted photons is n: 


fe @) 


Y npa =i 
6) 


The Poisson distribution (4-27) reflects the statistical independence of the 
emission of successive photons, which may also be seen on the factorized structure 
of the matrix element (4-24). On the other hand, let us examine the nature of 
the final state. We start at time —% from the vacuum state |0 in> which is an 
eigenvector of the annihilation part AM(*)(x): 


Ai*)(x) [0 in> = 0 


We may replace the time evolution of the state by that of the operator, and 
thus consider 


Att*)(x) [0 in> = S~ * AR (Ox)S [0 in> = AB*(x)|0 in> (4-28) 


Here A#(*) is the positive frequency part of the classical field involved in Eq. 
(4-11): 


i ies. | 
All (x) = sip (Pk zo eae (K) [pony (4-29) 
Hence we have 
<0 in| Abui(x)|0 in> = ABQ) = fe G(x ~ y)iM(y) (4-30) 


Equation (4-28) means that the final state is a coherent state (compare Chap. 
3). This is not contradictory with the Poisson distribution of the emission. Indeed, 
there is a deep connection between this property of the state to be an eigenstate 
of the operator A‘? and the statistical independence of successive emissions. In 
a loose sense, neglecting quantum fluctuations, the field in the final state is A,)(x). 
This is what is expressed by Eq. (4-30). 


4-1-2 Emitted Energy and the Infrared Catastrophe 
To obtain an average value of the energy emitted in the process we consider 
& = {0 in| H(A,,,)|0 in> 
= (0 in|S” * H(Aj,)S|0 in> 


= (0 in|s~? [a ko ale"(kyalP(k)S |0 in) (4-31) 
A=1,2 


and observe that the contributions of unphysical degrees of freedom cancel in the 
sum as expected. From Eq. (4-11), it follows that 


aPikj=S “aig th)s = a"(k)—iytk) A=he 
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and thus 


Bs . 1a 
€ = | a k°L| i(k) |? + |J2(k)|7] =| []Ji(k)|? + |J2(k)|?7] (4-32) 


This result agrees with the classical theory! Indeed, the radiated electromagnetic 
field F”’ corresponding to the potential A“, is 
d*k 


Pas) "OA, es, = |S 


£(k°)5(k?) e~ * *[k#I"(k) — k*J#(k)] 


The energy density is 


1 3 
w—=5) > e+ F Crtort 


and its integral, averaged over a long period of time, reads 


oa 3 
é.= [as ee) = [ak sin| > | k°I%k) — k°F(K)|? 


=1 


+ > Jest) — KH | 


l<a<Bs 


Using the decomposition (4-19) on the light cone, the longitudinal component 
of J does not contribute, and we get 


6. = {a k° [| J1(k)|? + | Jo(k)|7] (4-33) 


in agreement with (4-32). The interpretation of this result and of the one provided 
by Eq. (4-26) is that the number of photons emitted in a phase space element 
dk is 


dn = h~* dk[|Jx(k)|? + |Jo(®)|7] (4-34) 
and their energy is 
d& = hk° dn (4-35) 


In particular, if a finite amount of energy is emitted at low frequency, the 
number of soft photons di = d@/hk® tends to blow up. Typically, it may happen 
for certain currents J that fdé& is finite, whereas J d&/hk° diverges. This is the 
famous infrared catastrophe, already encountered in Chap. 1. We saw there that 
when a charge is suddenly accelerated from momentum p; to momentum p,, 


it creates a current 
sD pir ) 
2 — e( _ 
( Py kK pick 


As a consequence, it radiates a flow of photons, of finite total energy & ~ § dk°. 
Unfortunately, the total number of particles diverges as 1 ~ J dk°/hk°. Therefore 
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it is impossible to abstract a charged particle from its radiation field. However, 
as stressed in Chap. 1, it is meaningless to count the number of emitted soft 
photons, and the only physically measurable quantity is the emitted energy. 

Mathematically, the situation is really catastrophic. Indeed, when n— oo, 
we learn from Eqs. (4-21) and (4-26) that 


|<O out|0 in>| = e~”? +0 


Every matrix element between in- and out-states vanishes. Clearly, it is impossible 
to construct the “out” Fock space from the “in” space, nor to find a unitary 
operator S. This is exactly the same situation as in the simple model discussed 
at the beginning of this section. For a system with an infinite number of degrees 
of freedom, and under certain circumstances (here n = ~), inequivalent repre- 
sentations of the canonical commutation relations may exist. It is no wonder 
that we get into trouble here, since we try to describe the final states as super- 
positions of states of a finite number of photons, while we know that their 
actual number is infinite. 

Physically, we may cut off a part of phase space, i.e., we decide to observe 
only final photons within a certain range of energy momentum. This indeed 
corresponds to an experimental limitation. Every photon detector has a given 
finite resolution, and photons of energy lower than this resolution are therefore 
unobservable. Let R be the unobserved region of phase and Ce its complement. 
The total probability of emission in R only, i.e., to detect no photon 


pe= )i p=» ene e7@*®) = exp(—iic,) 
is nonvanishing since 
fic, = | dk [ |J1(k)|? + | Jo(k)|?] 


is finite, while each term of the sum vanishes as 7 > oo. The observed radiated 
energy &(Cr) = & — &(R) is as close to & as desired, for small enough R, and 
the probability of detecting at least a photon with momentum outside R, 
| — exp(—jic,), is finite. 


There is an alternative possibility to avoid the slippery subtleties of the previous treatment of infrared 
divergences, at least of its mathematical difficulties. We give the photon a small mass y and use the 
Stueckelberg gauge of Chap. 3 to quantize such a field. This will cut off the low-energy region 
since now k® > y and therefore remove the infrared divergence. However, we have to verify that the 
extra degrees of freedom do not introduce any spurious effect, namely, that observable quantities 
are not affected in the limit p — 0. 

We thus consider a massive photon field coupled to a conserved current. In the massless case, 
we recall that the indefinite metric state has played no role; only transverse degrees of freedom 
have been excited. Similarly, here, in the Stueckelberg gauge, the conservation law of the current 
implies that only the transverse field 


A 
Ag = An + 7 O,(8° A) 
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is coupled to the current 
| Azoore9 ox = | scarey d*x 


Therefore the negative metric states disappear. However, the longitudinal polarization state still 
contributes. We may compute #7, for instance, from Eqs. (4-16) and (3-148): 


a | d*x d*y [Al (x) + j(x), AD (y) -j(y)] 


aek 
aes | | ea - 7* 
|e J(k) J CK) po ye + p23)? (4-36) 
Typically for an accelerated charge, J-J* ~ 1 kj, as we recalled above. Cutting off integrals at some 
|k| = kmax, We get a potentially divergent 7: 


_ —_ wae Kmax 
* (k? aE ee n u 


while the emitted energy 
k ote 
= mm dk k Tt 
é~ ma lie a ae 
|, 4 we ao: 
has a smooth behavior as : +0. We also find that the contribution #, of longitudinal photons of 
given momentum k is vanishingly small with respect to that of transverse photons jir: 
my _ (al? — Wel? 2 (ol 
fir [Jre|? ||? | Sue? 


(4-37) 


where current conservation has been used to write |J,! = (ko/|k|)|Jo|. The introduction of this small 
photon mass has provided a convenient regularization of the infrared divergence. However, the 
existence of a third state of polarization might modify the blackbody radiation spectrum by a factor 
3/2. This difficulty is bypassed by the assumption that the equilibrium time for the third mode may 
be so large that the effect is totally unobservable. 


It must be clear from this discussion that the main features of the infrared 
divergences of quantum electrodynamics are essentially classical, and depend on 
the nature of the external current and on the experimental resolution. However, 
quantum effects manifest themselves, e.g., in the fluctuation of the number of 
emitted photons. Assuming n to be finite, we compute 


Ant=n—-nr=n 


and therefore 


oe 1 (4-38) 
n 


These fluctuations are very small if 7 is large. Similarly, the energy fluctuations 
read 


Ae? Se {a k& [|J1(K)|? + [Jo(k)|7] (4-39) 
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If the radiated frequencies are strongly peaked about the value <k°>, we have 
AE ~ (hXk>8)'? and = & ~ KKk®) 


and thus, as expected, 


(4-40) 


4-1-3 Induced Absorption and Emission 


We can use the explicit forms (4-13) and (4-16) obtained for the S$ matrix to discuss briefly some 
interesting situations. Namely, we may wonder how the presence of photons in the initial state 
affects the radiation of the source j. 

We have seen in Eq. (4-28) that a classical current creates a coherent state from the vacuum. 
This coherent state may in turn be considered as the initial state for a second source j. In other 
words, the first source j;,, which generates the initial state 


Vine? — exp |- i fas Ain(x) “in J in) (4-41) 


is assumed to be well separated from the second source j(x). The projection of the final state on out- 
states reads 


<b out| jine> = <b in| exp i {as Ain(x) 7) exp |- i | d*y Ain(y) “iad | |0 in) 
= exp | = ; {les d*y j,(x) G(x — fi) 


x Cbin| exp \-1 fey Ain(y)* Liine(y) +00} |0 in) (4-42) 


The first factor in the last expression is a pure phase, independent of the state b, and thus unobservable. 
The second term tells us that the final state is produced by the sum jing + j. The total number 
Ayo, Of photons is obtained by substituting J + J;,. for J in Eq. (4-26), and the (average) number of 
radiated photons, defined as the difference fi,,, — flinc, is 


Arad = Neot = Rinc = | a Jic(k) |? te 2 Re | a J,.(k) : Frcur(k) (4-43) 


The first term on the right-hand side of (4-43) is the number of photons emitted by the source j 
alone, while the second one is a typical interference term, which represents the stimulated absorp- 
tion or emission. We observe that this term is linear in Jine (OF in Ff.) and that terms corresponding 
to different frequencies are decoupled. This reflects again the independence of the various modes. 
The reader may compute the radiated energy in the presence of j;,, and observe again this interference 
phenomenon. On the one hand, we know that two classical sources interfere and energy interferences 
must imply interferences in photon number since da(k) = dé (k\/hk®. On the other hand, in view of the 
stochastic nature of the emission, it is rather surprising to find an interference in the number of 
emitted photons. This shows clearly the limitation of this stochastic interpretation. Emission and 
absorption are connected since an emitting source may also absorb photons. Then the equality 
Niot = Nine + Aj would mean the impossibility of absorption! 

It is instructive to compare this situation with the case where the initial state has a definite 
number of photons. For simplicity, we assume all photons to be in the same mode: 
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1 


|Rlinc in> = 


ine! 


| {ak face | \0 in) (4-44) 
A=1,2 
The function f, is normalized: 


J4#, vor = 


and it is understood that f(k) is peaked about a mean value. We shall only compute the average 
number of photons in the final state 


rot = Aline in | Nout| Ninc in> 


where Nout = [a at (k)a\)(k) = St | dk ¥ ald t(k)ald(k)S 
A=1,2 Aa 
= {az DY [al?*(k) — i(k) [al (k) + WF (K)] (4-45) 


We only count the number of transverse photons, since longitudinal and scalar photons are absent 
in the initial state and are not emitted. Thus 


Nor = Rinckts | a&t\s.00) ats | Jo(k)|?] 
+ Tinc| [a b ia)*(k)J,(k) + hermitian conjugate [in 
= Fine + | aR [00]? + [2/1 (4-46) 


The average number of radiated photons is the same as in the absence of photons in the initial state! 
On the average, there is no stimulated absorption or emission. But this does not mean that the 
emission probabilities are unchanged. Indeed, we may compute the probability of finding m photons 
in the final state. An easy calculation leads to 


i OFiine OFline a 
—f{exp—(nj+ |dk]z Y J#(k)f,(k) + complex conjugate 
2 


Prom = = - = 
bi RinctMm! OzTine OZMine =1,2 


x | fae > Lc + 007 |) 
A=1,2 


For a weak source J, we may keep the lowest order in J. The only nonvanishing transition prob- 
abilities to this order are 


(4-47) 


2-2 


Pracetcos = | a Y |Ialk)|? + fine] Sak Y F*(K)IAK)]? 
A=1,2 A=1,2 
Pir... = 1 | ak Y |Ja(k)|? — 2ine| Sak Yo Fk)? (4-48) 
A=1,2 : A=1,2 


Piige*Fine1 = Mine| SOK Yo fF(K)\Ia(k)|? = 1 — Pige-iige — Pine *fine* 1 


A=1,2 


The first expression may be compared with 


pons | a = | Ja(k)|? 
A=1,2 
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The presence of photons in the initial state has raised the probability of emission. This is the basic 
result of stimulated emission. From Eqs. (4-48) it is also easy to check that the average number of 
radiated photons is still nj. 


4-1-4 S Matrix and Evolution Operator 


The derivation of the S matrix in Eq. (4-13) relied on the assumption that the 
source was not an independent dynamical variable, i.e., that its time evolution 
was given once and for all. This method has left an arbitrary, source-dependent 
phase in S. It is therefore appropriate to set up now the general formalism and 
to compare its results in this specific case. The idea is to construct the operator 
that realizes the time-dependent canonical transformation relating the interacting 
field A” to the incoming field A‘: 


A*(t, x) = U~ *(t) A(t, x) U(t) (4-49) 


As already discussed and expressed in Eq. (4-8), A#, is some weak limit of A” 
as t+ — oo, and, consequently, in that limit U reduces to the identity 


im U(t)=1 (4-50) 


In the canonical quantization, we deal with the operators A(r) and x(t) that 
satisfy 


x 


5 A(t, x) = i[H(0, A(t, x)] (4-51) 


and an analogous equation for x(t). Here H(t) = H[ A(t), 2(t), j(t)] denotes the 
hamiltonian. Similarly, the in-field A;, satisfies 


0 
ot Ainge x) = iLHo’, Ai, (t, x) | 


where Hi" is the time-independent free hamiltonian [compare Eq. (3-120)] ex- 
pressed in terms of the in-creation and annihilation operators. Let us derive the 
time evolution equation for the operator U(t). Unitarity requires that 


d - a 
E vw |u 1(f) + uO U-4(t) =0 
Moreover, it follows from Eq. (4-49) that 
U(t)H(A(), x(2), (Q)U-*() = H(Ain(0), tin), (0) 


Now 


0 0 
3 Ainlt, 9) = 5 [UMA NU“) 


<2 U- 1(t) Ain(t, x) a iU(t) [ H(A(0), mt), j(t)), A(t, x) | Lilie 1(r) 


d 
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+ Ault, XU) £ U1 


dU 
_ U"'(t) + 1H (Ain(0), min(t), j(0)), Ain(t, »)| 


= iLHé, Ante x)] 


with an analogous equation for z;,. We conclude that the operator 


dU 
a U~*(@) + iLH(Ain(?), tin(t) (0) — HE] 


commutes with every “in” operator and is thus a c number. This number will 
not contribute to normalized matrix elements of U (this point will be justified 
later), and we discard it hereafter. Then U satisfies the evolution equation 


dU 


i = LH (Aia(0), tial), () — HE] 


= Hint(Ain(t), Tin(t), j(O)U = A,(t)U (4-52) 


In our case Hi? = H(A,,(t), Tin(t), J)| j=0; and the interaction hamiltonian H,(t) 
vanishes with j. The evolution dictated by the operator U(t) will thus be referred 
to as the interaction representation. Notice that H;(t) depends on time both 
through the time dependence of j(t) and of Aj;,(t) and 7;,(t). Equation (4-52) 
may be solved by iteration of the corresponding integral equation: 


t 


dt, Hy(ts)U(ts) (4-53) 


vig=1-i{ 


where the boundary condition (4-50) has been taken into account, with the 
result that 


=O 


U(t) =[—-ji { dt; Hy{t;) + (—i)? { dt, [- dt H,(t,)H,(t2) a 


imme 6) 


+a |) at, |" dts [dy atts) Hil) + (4-54) 


In these expressions, the ordering of the operators is important: it is related to the 
ordering of their time arguments. We are led to the general definitions of the 
time-ordered product of n operators (T product, for short): 


T[Ai(t1)°+* An(te)] = ¥ O(tp,, tp,,.--, tr,)epAp,(te,)*** Ap,(te,) (4-55) 


where the sum runs over all permutations P, the 6 function enforces the condition 


Ge tp, = = te, 
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and &p denotes the signature of the fermion operators permutation involved in 
this product. (The latter point is of no use now, but will serve us soon.) Here, the 
T product is symmetric and we may write 


U(t) = (ir { dty | | dt | “dty T[Hity)-- Hilta)] 
0 400 108) —100 
- y = { dt, { dt, | dt, TLH,(t1)-*- Hi(tn) J 
n=0 ~ a - ow =x 
= T exp | -: { dt’ He | (4-56) 


The last expression is symbolic. Its meaning is given by the previous line. How- 
ever, T products of exponentials enjoy an important property, which may be 
proved easily. For t; < tz < ts, 


3 ty t2 
T exp \ di O(t)= T exp \ dt O(t) T exp | dt O(t) 
t t 


If H,(t) +0 as t > — oo, the operator U(t) of Eq. (4-56) does satisfy the require- 
ment (4-50). This is what happens in our case if the current is adiabatically 
switched off in the remote past. The S matrix is defined as the limit 


S= jim U(t) = T exp E | dt’ m0 | (4-57) 


— ce 


In the present case, H, = § j(x)* Ain(x) d?x, and we get 


S = T exp E [ax iulx) Ait | (4-58) 


In this expression, the T symbol contains the only reference to a definite time 
coordinate. This looks noncovariant, but owing to local commutativity the density 
H (x) = j*(x)A,(x) commutes with #;(y) if (x — y)? < 0. Therefore a change of 
frame does not modify the expression of the T product. In every theory without 
derivative coupling we have 


H(t) = [ex i [es L(x) (4-59) 


where it is understood that x° stands for ¢ and that all operators are in-operators. 
Finally, the general “covariant” expression for the § matrix reads 


S = T exp [ [asx ey ic | (4-60) 


The quotation marks recall that our previous argument of locality does not 
rule out the occurrence of short-distance, a priori noncovariant singularities, 
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when two arguments coincide. The treatment of these singularities is the subject 
of renormalization theory. 

We want to compare the general expression [(4-58), (4-60)] with our previous 
result (4-13). For the two quantities to agree, the difference should be an un- 
‘observable phase. This is what we are going to check, using the property of free 
fields to have a c-cnumber commutator. We give here a simple heuristic proof 
and let the reader elaborate it further. At any rate, this property is related to the 
important Wick theorem, which will soon be examined in detail. Let us write the 
S matrix as a limit 


LY pied et. 8) 
t,+ +00 


t 
S= lim T exp E (" dt 1140 | 
t; 


divide the interval t, — t; into N equal ones At = (ty — ti)/N, and denote by 
t, = t; + [(2k — 1)/2] At (k = 1,...,N) the midpoint of each one. In the large N 
limit the T product is approximated by 


t 
Texp |- | if dt Hi = e— iAt Hilt) e~ tAt Hilty -1) Sau e~ tAt Ai(ts) 


t; 


Since the commutator [H,(t), H,(t’)] is a c number, we use the identity (4-15) 
to rewrite this product as 


N 
exp - iAt) Ay(t,)—2At? > (Arts) rice} 
rr 1<k<I<N 


Letting N go to infinity and taking then the limits t; + — 00, ty + 00, we find 
S = exp | - {a Ais) 09 exp \-4 [atx d*y @(x° — y°) 


x [Ain(x) -j), Ainly) ion} (4-61) 


The operators Aj,(x)*j(x) and Aj,(y)-j(y) are hermitian free fields ; therefore their 
commutator is an imaginary c number. Indeed, we know the expression of the 
retarded commutator [(1-169) and (1-170)]: 


O(x° — y)[A%(x), Ala(y)] = igt” O(x® — yG(x — y) = igh” Greil — y) 


where 


d*k gas) 1 5 
= a 6 (x — y)d[(x — y) 
Gre(X — y) \s pee? ke = oH (x° — y)6[(x — y)*] 


Finally, we find 


S = exp E | ats Als) J0 | exp |- ; {| Oa yy ()Grelx — nj 
(4-62) 
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As in Eq. (4-16), we may write the normal form of this expression: 


SS: exp |= [as Ants) i : exp (; [ats d*y j,Ax) {[ Ala (x), Ala Cy) 


i A(x° a: y’) [ Af.(x), Aj,(y)]} i0)) 


The quantity within the brackets in the second exponential is a c number, and 
therefore is equal to its vacuum expectation value (in the state |0 in>). We 
omit for simplicity the subscripts “in” and write 


CO] [AO (x), A *(y)] — O(2e° — y°)[AX(x), A’(y)] [07 
— <O| A*(y)A"(x) + O(x° — y°)[A4(x), A°()] |0> 
_ <0| O(x° — y°) A*(x) A*(y) + A(y® — x°)A"(y)A*(x) |0> 
— <0| TA“(x)A”(y) |0> 
dtk e7~ik &-»y) 
ig" \as les 


To summarize, the normal form of the S matrix reads 


S = :exp E | ax A,,(x) 9 | 


x exp - 5 | | atx d*y (0 in| TAig(X)-J(%) Ain() J) [0 in) | (4-63) 


We emphasize again that this result differs only by an unobservable phase from 
the expression given in Eq. (4-16). 


4-2 WICK’S THEOREM 


This section is devoted to a study of the algebraic manipulations that allow 
one to reorder a T product of operators, such as those appearing in the interaction 
representation, into a normal form, more convenient for the actual computation 
of matrix elements in the Fock space. In the sequel, all the fields are free fields. 


4-2-1 Bose Fields 


We have just derived, for a free massless vector field, the identity 


T exp | -: [ax Acs) 109 | =r. exp E [ats A(x) 00 | 


xe |-5 i d*x d*y 0] TA(x) J) AQ) JO) o> | (4-64) 
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This result depends neither on the vector nature nor on the masslessness of the 
field. The same formula holds for any boson field. Therefore, we shall not specify 
whether we deal with scalar, or vector, tensor, etc., fields, and omit hereafter 
all Lorentz indices. Equation (4-64) may be considered as the generating function 
of a set of identities, known as the Wick theorem. We expand the exponentials 
and identify the coefficient of j(x,)---j(x,), symmetrized over x, ‘es pgeu(Giace 
ultimately this product and its coefficient is integrated over x, ---x,). We obtain 


T[A(x1)A(x2)] = : A(x1)A(x2): + (0| TA(x1)A(x2)|0> 
T[A(x1)A(x2)A(x3)] = : A(x) A(x2)A(x3): + 2 A(X): <0| TA(x2)A(x3) |0> 
+ : A(x2): <0] TA(x1)A(x3) |O> 
+ : A(x3): <0] TA(x1)A(x2) |O> 
DLAC): AG i= 5 AG) A(x,): 


+ Ys Ale): Ala) AG Ale): <0] TAC) Ale) OD + 
k<l 
a y : A(oes)++* Aly) +++ Ata) +++ Ale): 


ky<ky< +s <k2, 
x py <O| TLA(Xkr,) (Xen) ] [OD --+ O| TEA (cheray-1) ACen) ] [OD + °° 
(4-65) 


In these formulas, the caret above a term means that it is to be omitted from 
the product, and the sum )’, runs over all permutations that lead to different 
expressions. In words, Eq. (4-65) expresses the T product as the sum of all possible 
normal products where some pairs of fields have been omitted and replaced by 
their contraction, 1.e., the vacuum expectation value of their T product. 

It is a good exercise to prove directly the identities (4-65). This is most easily 
done by induction. We also let readers convince themselves that they may be 
generalized to the T product of distinct fields TA,(x,)-*:An(x,). They may be 
further extended to an expression of the form 


with the restriction that only contractions between distinct normal ordered pro- 
ducts occur. Identities similar to (4-65) may be written in order to express the 
ordinary product A;(x,)°:: A,(x,) in terms of normal products. The only modifica- 
tion is that now contractions represent the vacuum expectation value of the 
ordinary product 


Ag(xr)*** An(n) =: Auloer)*** Ag (6p) A (k) 1 AGE: 


x {<0 Ax, (X%,) Ax, (Xx,) |o> er <0| Ax, ,-1%kay—1) Ag,,(k:,) |0> 


+ permutations} 
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Finally, analogous identities connect T products and ordinary products. In that 
case, the contraction stands for a retarded Green function ; for instance, 
TA(x)A(y) = A(x) A(y) + O(y° — x°)[A(y), ACD] 
From Eq. (4-65), it follows that 
<0| TLA(x1)*** A(x2p-1)] [0 = 0 


(4-66) 
<0| TA(x1)*** A(X2p) |0> = ¥; <0] TA(xp,) A(xP,) [0° 
x (O| TA(xp,,_,)A(xp,,) |0? 


The symmetric form of the right-hand side has been called “haffnian” by Caianiello. 
Using (4-66), Eq. (4-65) may be rewritten in the weaker form 


[n/2] 
TA(x1)-*A(im)= SY Aer) AG) Ate, Aloe): 
p=0 k\<°°+ <k2, 
x €0| TLA(Q%,)°** AQx,,)] |O> (4-67) 


4-2-2 Fermi Fields 


When Fermi fields are involved the only modifications to be made in Eq. (4-65) 
are the signs. As in the Bose case, we shall derive an identity relating the 
generating function of the T products to the one of normal products. The deriva- 
tion for boson fields was based on the identity (4-15). In order to deal only with 
commutators, when fermion fields are present, we introduce anticommuting 
sources 7 and y for w and W. These quantities anticommute among themselves as 
well as with y and w. We insist on their purely mathematical role (they are 
elements of a Grassmann—i.e., anticommuting —algebra). They enable us to write 
the identity 


[i(x)W(), WO)n(y)] = HO) {WOd, VO} ny) (4-68) 


All other commutators of my, Wn vanish. We now introduce the fictitious inter- 
action lagrangian 


L(x) = n(x) h(x) + W(x)n(x) (4-69) 
and the corresponding S matrix 


SHaexpi | d*x [q(x) W(x) + W(x) n(x)] (4-70) 


Since the commutator [;(x), Y:(y)] commutes with Y;(z), we may use again 
our favorite identity (4-15) and get 


T expi {as [i Wx) + Wx)n0)] = exp | a*<ticowen te Boone} 


ae | 5 | [atx dy O° — y°) [ADW(x) + Hen, alyh(y) + Bonin} 
(4-71) 
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In the same way, we may decompose fy + wn into a sum of annihilation, positive 
frequency part (superscript +), and creation part (superscript —), and write 


emp 4 [as [200W(x) + WOdn0d] = exp i [ar LIK”) + WO "()n00)] 
x Expl [ax LHW) + VOX) 


x exp . {es d*y [H(W'°%) + WO), HOW (y) + mon 


(4-72) 
As above, we replace the c number: 


[as d*y {0x9 — y) LHW) + WOOD, AYO) + VOnW)] 


— [HW (x) + POX), AWOL) + WO) n(y)]} 


by its vacuum expectation value, that is, 


<0| {fer d*y {A(x° — y)[Fi(x), Lily] + Ly) Lr(x)} |O> 


=O) | | d*x d*y TL L(x) £r(y)] |0> 
Owing to the anticommutation of the y, this is nothing but 
{| d*x d*y [Malx)<O| O(x° — y)Walx)W p(y) — O(y° — x°)Wely) Wax) |0> na(y) 


+ nalx)<O| O(x° — yyalx)Wo(y) — O(y? — x° WoW Wal2 0? Ho(0)] 


Notice that the other terms in WwW or WW have disappeared in the integration. 
We recognize the definition of the T product of Fermi fields [see Eqs. (3-173) 
or (4-55)]: 


| | d*x d*y [Halx)<O| Tal Woly) |0> np(y) + nalx) <0] THa(Wa(y) O>77())] 


Both terms give the same result, after an interchange of the variables x and y 
under the integration sign. Finally, we obtain 


exp i {as [9(x) p(x) + W(x)n(x)] = uy [ats [H(x)W(x) + W(x)n(x)]: 


x exp | - [Jer d*y A(x) <O| TH(x) Wy) o>n | (4-73) 


The only difference with the corresponding formula for Bose fields (4-64) lies 
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in the absence of the factor 4 in front of the last exponent. This is due to the 
fermion charge. Had we started with a charged, nonhermitian Bose field A, we 
would have written £;(x) = [A*(x)j(x) + j*(x)A(x)] and obtained two equal con- 
tributions in the exponent <O| Jf d*x d*y TL Pix) Fi(y)]|0>, and therefore no 
factor 4 either. The explicit relations between time-ordered and normal products 
follow when expanding (4-73) and taking into account the anticommutation of 
the n, 7. For example, to second order in the fields, we get 


3 [asx d*y TACO) + Won] LHW) + Wyn) 


=% {asx d*y :[HlW(x) + Woon] LHWO) + VON]: 


a | | d*x d*y A(x) <0] TL) W(y)] |0> 0) 
and after identification 
TY(x)h(y) = 2 WOx)W(y): 
TH(xW(y) = : WOW(y): 
WOODY) = : WO(y): + <0] THODH(y) 0 
THY) = : WEDY(y): + CO] THOE)W(y) [0D 
We note that the two last expressions are equivalent, since 


T[WAxWA(y)] wi TWAAvWAx)] 


(€ and yn are spinor indices). For the sake of compactness, it is convenient to 
include in the argument of y the space-time coordinate, the Dirac index, and a 
discrete index which distinguishes w,(x) from w,(x). With this convention, the 
general expression of Wick’s theorem for Fermi fields reads 


(4-74) 


{n/2] 
TWO) Wed] = YY on: WO) Wk) Way) Hon: 


x <O| Tike, W(kp,)|0>--- <0] TWike,,_,)Wlke,,)|0> (4-75) 


One sums over all distinct contractions, and op = +1 is the signature of the 
permutation which transforms {1,...,} into {1,...5k},...ch2p,--.5, kpy..-skp,,}- 
Let us illustrate this rule on the T product of four fields: 


TL DY2W3)W4} = : VOY 2)Y3)¥4): + : WW): <0] THB)W(4)|0> 
—  W(L)W(3): <0] TH(2)W(4)|O> + : W(LDY(4) : <O| TY(2)W(3) |0> 

+ : W(2)W(3): 0] THA)W(4)O> — : W(2)W(4): <0] TWA) WB) |O> 

+ : W(3)W(4): 0] TH(L)Y(2)|0> + <0] TH(1)H(2) |0> <0] TW(3)W(4) |O> 

— O| TY(1)Y(3) |0> <0] TH(2)W(4)|O> + <O| TW(A)W(4) 0 <O] TY(2)W(3) 0D 
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Equation (4-75) may be specialized to the vacuum expectation value: 
<0| TW(1)--- W(2n — 1)|0> = 0 
| TW) W2n|0>= SY — ep KO] TH(P,)W(P2)|0--- 


distinct terms 


x <0] TW(Pan—1)(P2n)|0> (4-76) 


5 Serre»... 


permutations P 2"n! 
x <0] TW(Pan—1)W(Pon) |O> 


The expression on the right-hand side of (4-76) is a pfaffian, i.e., the square root 
of the determinant of the antisymmetric 2n x 2n matrix with generic element 


<0| TH) W()|O>. 


. 4-2-3 General Case 


In the general case, where both fermion and boson fields are involved, the 
expression of the Wick theorem may be easily derived. Distinct fermion fields 
anticommute, distinct Bose fields commute, and a boson field is assumed to 
commute with a fermion field (the latter point is just a matter of convention). 
We then decide to use as above a compact notation: wy denotes either a boson 
or a fermion field, with a discrete index to distinguish its nature. It must then 
be clear to the reader that the formula (4-75) is still valid, provided we under- 
stand that the sign op is the signature of the permutation of the Fermi fields 
only. This result follows directly from the generating function (4-73), which has 
the same form whatever the fields are. 

The Wick theorem discussed in this section has a close relationship to the 
computation of the integral of a polynomial of several variables, with a gaussian 
weight. This analogy is not fortuitous, as we shall see later in the discussion 
of functional integrals. 


4-3 QUANTIZED DIRAC FIELD INTERACTING WITH 
A CLASSICAL POTENTIAL 


4-3-1 General Formalism 


After this algebraic interlude we shall now discuss another case of a quadratic 
lagrangian, namely, the electron-positron field in the presence of a classical electro- 
magnetic field. Electron-positron pairs will possibly be created in this external 
field. This is the physical counterpart of the problem discussed in Sec. 4-1, 
whereas the creation or annihilation of single electrons by the anticommuting 
source of the last section was its mathematical counterpart. 

Besides interesting phenomena such as pair creation—which has clearly no 
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classical equivalent—this study should also give us some insight into the role of 
the Dirac equation for the quantized field wy: 


[id — eA(x) — m] W(x) = 0 (4-77) 

Here A“(x) is a given c number external field. This equation corresponds to the 
interaction lagrangian 

Ly(x) = — Hx) = — ep(x)y*W(x)A,(x) (4-78) 


All the steps leading to the interaction representation and to the S matrix in 
Sec. 4-1-4 may be repeated here. This results in the expression 


S = Texp E ie [as Diats"Vibod Aut) (4-79) 


But now, it is not trivial to put this expression into a normal form. This is 
because the interaction lagrangian is quadratic in the fields, rather than linear, 
and therefore the commutator of two currents wy“w is not a c number. Notice 
also that we do not know the general solution of the Dirac equation (4-77) in 
an arbitrary field A,,. 

In spite of these difficulties, we have all the machinery of the Wicks theorem 
at our disposal, and we can find a formal solution of the problem. Let us con- 
centrate on the probability amplitude of emitting no pair: 


So(A) = <0 in| S|0 in> 


(— ie)" 


= > re [ax + dXq SOl TL (x1) A(x W1) > Wen) ACen) W%n)] [OD 
(4-80) 


where “in” subscripts have been omitted. According to the Wick theorem, each 
term in (4-80) is a sum of products of contractions of the form 


<0] TAC) Wa) U(x) |0> 


For x, and x; given, let us introduce the 4 x 4 matrix 
C (On, Xx; 1, X1) = —ie »y <0| T[AazalXk) Wal Xx)Wo,(X1) | |0> (4-81) 


In terms of C, So(A) reads 


pous 


(4-82) 


It is convenient to consider the discrete indices «; and the continuous variables 
x; on the same footing and to group them in a bracket notation |x, «>. The 
space of these vectors |x, «> is nothing but the space of classical spinors. We 
introduce the matrix T of generic element 


<x, a| Py, B> = C(x, a; y, B) (4-83) 
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Using this matrix notation, we may write 


So(A) = Det (I —T) = exp[Tr In(J — T)] (4-84) 


These expressions are indeed true for any finite-dimensional matrix I: 


det (I — T) = exp[tr In(J —T)] 


— 1)? 
Prue > Waalns — Vasl se) + °° * 
a a,b 
(=4)! 
ies 2 (4-85) 
. ay ta, P 


permutations 


as a result of the Cayley-Hamilton formula for a determinant. E 

This expansion stops at the dth order for a finite d-dimensional matrix. The term of order d 
is, of course, (— 1)‘ det I. This formula expresses the determinant as a sum of traces of antisym- 
metrized tensor products. There exists a similar formula for the inverse determinant 


det™ "(I — T) = exp[—tr In(I - Ty] 
= jl 6 ay Toa + 3 »y (Foals + Vasloa) + °°” 
a a,b 


1 
oP? ar ian > Vay, on Tap, stone s* (4-86) 


that is, as a sum of traces of symmetrized tensor products. This Jatter expression would be relevant 
for the problem of quantized boson fields coupled to an external field. 


These expressions extend to the case of infinite matrices in the framework 
of the Fredholm theory, under proper assumptions on the behavior of I’. We 
use capital letters for Det and Tr to recall that these operations imply integrations 
over continuous variables. The use of normal products in (4-78) would amount 
to the assumption that I’,, = 0, and therefore to dropping the term ye Ty, in (4-85). 

As in the last section of Chap. 2, it is convenient to introduce operators 
X,,and P,, defined on the states |x, «>: 


DG epee 06a p0n0e> 
ieee) 
<x, a Pulp? = iG Oe a 9? 


They enjoy the canonical commutation relation 
[Xp Py] = —iguy (4-87) 
and eigenvectors of P,, denoted |p>, are such that 
ach 
(21)? 


Using these matrix notations, together with the expression (3-174) of the Dirac 


Grieg —"<x|p)* = 
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propagator, we find 


1 
mere 


: | 
P—mt+ie 
1 
=) Det \? = eA(x) Pate ig] oe 


1 
= exp jn In G — m) | 


The last expression exhibits the Feynman propagator in the external field A,(x). 


So(A) = Det [ — eA(x) 
(4-88) 


It is a good exercise to check gauge invariance. Under a gauge transformation 
A,(x) > Ay(x) + 6,0(x) 
P — eA(x) — m— P — eA(x) — eG D(x) —m 
The relations (4-87) imply that 
e~ OX) TP — e A(x) — m] e*) = P — A(x) — eG (x) — m 


Therefore a gauge transformation amounts to a unitary transformation in the space of the classical 
spinors. It does not affect the determinants and leaves So(A) invariant. 


Let us examine the consequences of the unitarity of the S$ matrix. It is con- 
venient to introduce a one-body scattering operator ¥(A) defined by 


T (A) = eA(x) + eA(x) 7 (A) (4-89) 


P—m+ie 
We assume the potential A,(x) to be real; for every operator B on the space 
of vectors |x, «>, we set 

B= y° Bry 
where the hermitian conjugation acts on the indices x and x. This operation 
changes the sign of is in 7 (A): 


F(A) = — F | 
T (A) = eA(x) + eA(x) Pm F(A) = eA(x) + F(A) (ae e A(x) 
(4-90) 
Thus 1 
eA(x) = F(A) ~ F(A) Pon eA(x) 
The insertion of this expression into (4-89) leads to 
T = aed Vee 1 1 am 
7 (A) = eA(x) + E (A) — 7 (A) Pom ne cA Pyare (A) 
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= FT 1 TFT Ti i ; 
= eA(x) + F(A) is il (A) — F(A) iar [7 (A) — eA(x)] 
= 1 
= 0) pry? TF (A) + eA(x) + —" 
ar 1 T 
= F()| 5 ml (A+ F 


TJ (A) — F(A)= F(A) 


1 1 
—— sae? (A) 


P—m+ie P—m-—ie 


TF 1 1 FF 
1 A eee ae oe 


= F(A) E (P + m)d(P? — m)] T (A) (4-91) 


The operator 2(P + m)d(P? — m?) may be decomposed into a sum of projectors 
Over positive and negative energy states 


2n(P + m)d(P? — m?) = p*) + pO 


4-9 
p\>(P) = 2n(P + m)6(+ P°)5(P? — m?) “ia 


These operators which project on the mass shell will emerge naturally in the 
computation of physical quantities, such as |So(A)|?. 

If we return to the original expression (4-82) of So(A), every term in the 
right-hand side may be decomposed into cycles: 


C(a1, X15 2, X2)C (a2, X25 3, X3) °° Cote, X35 1, X1) 


Had we used the retarded propagator instead of the Feynman one, all these 
cyclic terms would vanish, since the retarded Green function has its support 
inside the future cone and thus a cyclic term has an empty support. As noted in 
Chap. 2, the retarded propagator has the form 


P+m 7 1 
(P+ ie)? —m* P—m+i¢ 


where ¢, is an infinitesimal positive time-like four-vector. We conclude that 
1 
Det | I — ————— = 1 4-93 
et| cA) ga | (4-93) 


But 


Poni = (P +m) E (= Z =) — in e(P°)d(P2 — n’)| 
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where PP means principal part. Therefore, 


and 
1 = Det| 000) ae eee ae aa 
= Det _ e A(x) ron = ieACxp™ | 


1 
= 40) aa aes i — eA(x) poece |Z 


Fomve ew f 


[pe [I — iF (A)p} (4-94) 


Si 
—— 
Lael 
| 
has} 
> 
— 
i 


= | Diet sl — eA(x) Pon 


From these expressions and from (4-91), it follows that 
[So(A)]~* = Det [I — i7(A)p] 
|So(A)|"? = Det {[I — iF (A)p'][T + iF (A) ]} 
= Det {I + i[7(A)— F(A] pO + F(A)pF (Alp? (4-95) 
= Det [I — F(A)pF(A)p™ ] 
= exp {Tr In [I — F(A)pPF(A)p}} 
We write 


|So(A)|? = exp |- [atx v6) | (4-96) 

with 
w(x) = tr <x|In {I — Z(A)p F(A)p] |x) (4-97) 
(where the trace now refers only to the Dirac indices), and we interpret w(x) as a 


probability density for pair creation. If we divide the four-dimensional space into 
small cells of volume Ax; about the points x;, we may approximate 


|S0(4)|? ~ [] [1 — Axi] 


which confirms the previous interpretation. Each cell gives an independent con- 
tribution w(x;) to the emission probability. This may also be seen by an explicit 


INTERACTION WITH AN EXTERNAL FIELD 191 


computation of the probability of emission of one, two, etc., pairs. Because of 
the presence of the projectors p‘*’, the probability w(x) d*x is expressed in terms 
of on-shell matrix elements of 7 between a particle and a hole state. This is 
made clearer by the introduction of a reduced operator t in the following way. 
Using the normalized positive and negative energy solutions u'®(p), v‘(p), intro- 
duced in Chap. 2, we define the matrix r on the mass shell by 


<pa|t|p’b> = 22> rs HUD. a|7 |p’, B>vP'(—p’) (4-98) 
2B Op Wp’ 


with «w, = (p* + m’)'*. From the expressions of the projectors 


») u(p)ii(p) = B aa for p° >0 
a=1,2 
“ p+m 
9 ( — n\n (npn) = — 0 
we b(—pe"(— p) = for p® <0 
we then find 
Ce [atx w(x) = [ep tr In (1 + tt’) (4-99) 


This exhibits clearly the positivity of Wi... We shall now compute explicitly this 
probability in two special instances. 


4-3-2 Emission Rate to Lowest Order 


It is possible to get an expansion of W in powers of « = e?/4n by expanding 
both the logarithm and 7 (A) in Eq. (4-97). To lowest order, we find 


WY) = —Tr(eAp™ eAp™) (4-100) 
We set 


1 1 i(p— p')>x 
<p| A(X) |p’? = (Qn? a,(p — p') = (On) {ats A(xee 2" (4a) 


Thus 


wir = [ata sta - Pi ~ pa) [5B S22 [Ube + maton - md(—4)] 


(4-102) 
But if g = pi + p2, we have 


4 tr [(p1 + m)a(q)(P2 — m)a(—4)] 


2 
= E = a(q)* a(—q) + p1°a(q)p2*a(— 4) + pr-a(—a)ps- a) 
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We then compute the integrals [q” = (p1 + p2)? = 4m]: 


a | 
-|¢ pide 6*(q — Pp, — p2) = 4a | aie 5 OWT — 2) 
2m 


@M,; W2 
2 \1/2 4 2 
=2n (i ail ) mel ) (4-103a) 
q q 
dp, : 
oe ( a fi tq = pi 
WM, (0) 


i 

a 
Indeed, IJ,,, has the general form I,,, = [1g,. + Ioq,q, and successive contractions 
with 9,qy and g, lead to (4-103b). This yields for W™): 


d*q 4m? \1/2 4m? 
WY) = 2 (<4 (1 - ) 6(q°)6{ 1 — 2q? + 4m? 


x {q-a(q)q° a(—q) — q’a(q)- a(—q)} (4-104) 


7) + g*(q? — sn’) | (4-103b) 


This result will be made more transparent if we introduce the Fourier transform 
of the electromagnetic field 


F,,,(g) =e i[quay(q) = qva,(q) | 
4 F,,(q)F""(—q) = |B(q)|? — |E(q)|? 
= q’a(q)* a(— 4) — q-.a(q)q° a(—q) 


Finally, since the integrand in (4-104) is even, we may forget the restriction 


GQ. > 2m 
PAN SYP2 9 
ue i ( 2 -) (4-105) 


This is a nontrivial result of field theory, but it could have been pee through 
a careful interpretation of the hole theory. Observe that for g? > 0, there exists 
a frame where q=0 and thus B(q) = —iq x a(q) vanishes; this implies that 
|E(q)|? — |B(q)|? = 9, els is, pair creation is an electric effect. However, electro- 
magnetic modes with q* > 4m” are not that easy to generate by the current 
technology! 

We quote also the corresponding rate for the pair creation of spinless charged 
bosons, to lowest order 


wh) — - [ata 8(q* — 4m*)[|E(q)|? — | B(q)|7] (: = 


a 4 2 2 ee 
Weotons = at q 8(q° — 4m*)[|E(q)|* — |Biq)|?] (: a ) (4-106) 
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4-3-3 Pair Creation in a Constant Uniform Electric Field 


In the case of a constant uniform field, we are able to derive an exact, non- 
perturbative result. We return to Eq. (4-88) and write 


In'Se(4yeTr In \ — eA(X) — m + ie] (4-107) 


1 
P—m+ie 
Since the trace of an operator is invariant under transposition and since the 
charge conjugation matrix C satisfies 


Cys, a —Yh 
we may write 


InuSig(A)= Tr In IP — eA(X) +m -— ie] ot (4-108) 
and the sum of (4-107) and (4-108) reads 
2 IniSeG= Tein (a — eA(X)]? — m* + ie} oars (4-109) 
The useful identity 
ine i dS (eisto+ie) _ gistatte (4-110) 
b o Ss 
enables us to write for the desired probability 


w(x) = Re \, & e~ istm?—ie) ty (<x| eis(P- eA") nce els?" |x)) 
Oo » 


= Retr [ bad er) (4-111) 
o 8 


<x| (exp ‘8 C — eA(x))? + 5 ewkeoy — on) \x> 


For a constant field, this probability should not depend on x. Moreover, OyyF fe 
commutes with all the other operators, and we may compute its exponential. 
From now on, we assume that the electromagnetic field is purely electric (we 
have seen in the last section that the pair creation is an electric effect) and that 
the electric field is along the z axis. We also choose a gauge such that only 
A(x) = —Et(t = x°) is nonvanishing. Then 


tr e'seewF"/2 — 4 cosh (seE) (4-112) 
and using the commutation relation [ Xo, Po| = —i, we get 
(P — eA)? — m? = P3 — P}. — (P? + eEX°Y 
= Gee 2 (pe = P2 = e? E? X°7) eiPoP?/eE (4-113) 
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Therefore, 


tr | eisl(P—eA)? +0,,F""/2] |x» 


dw da’ ete amt + p?>/eE) — isp? <o| eistPd - e*E? Xen! 


d°p 
= 4 cosh (seE) \@ a 


2eE 7 is(P3 — 6? E? X32) 4 
= Gptis -~—>— cosh (eEs) = dw <a] els 3) lq > (4-114) 
The last integral may be considered as the trace of the evolution operator of an 
harmonic oscillator with a purely imaginary frequency. This results from the 
correspondence Py > P, — Xo > Q, 2iecE + wo, 4+ mo, P3 — e? E? X3 > P?/2mo + 
4mow6Q*. The energy levels of such a system are well known and it follows that 


2 1 
Tr exp] is i + mee Q?}|= > exp|is|n+ = }ao 
2m 2 n=O 2 


i 


ere 4- 
2 SiN SWo/2 ae 
so that 
00 : if 
is(P3 —e?E? X3) eat ele oe 4. 
{- Be Sa : o> = sank (seE) (HIG 


Collecting all the terms, we find 


© q — 
\ S E coth (eEs) — :| Re (ie~ istm*— ie) (4-117) 
0 


Potential energy 


VollelE eEx 


Figure 4-1 Potential energy (solid line) of an electron submitted to the binding potential V(x) 
(dashed line) and to the electric potential eEx. 
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The term 1/s corresponds to the subtraction at e = 0 in (4-111). It must be noted 
that the integral converges at both ends: the infrared convergence at s— oo is 
insured by the m? — ie prescription, while at s +0, Re(ie~*”") = sin sm? ~ sm?, 
the bracket vanishes like s and the integrand is finite. The integral in (4-117) 
‘may be computed by the method of residues. It is first transformed into an 
integral over (— oo, +00); the integration contour is then deformed to encircle 
the negative imaginary axis and to pick up the contributions of the poles of the 
coth function. This leads to the final expression for the probability of creating 
pairs per unit volume and unit time: 


aneae | nnm* 
w= =e = exp(- | (4-118) 


Pair creation in a constant field has never been directly observed, because of 
the smallness of typical |eE| as compared to m?. We recall, for instance, that the 
electric field E,, on a Bohr orbit of an hydrogen atom is of order eE,, ~ m?a°> ~ 
dun 107 damn?. 

The analogous expression for creation of spinless bosons reads 


2 © See tee 2 
sia Yi a exp ( a (4-119) 


bosons — 2n? = leE| 


In these formulas, the essential factor is nonperturbative: exp(—2m?/|eE]). This 
is reminiscent of the quantum phenomenon of tunneling through a potential 
barrier. Indeed, in atomic physics, let us consider an electron bound by the 
potential V(x) (dashed line in Fig. 4-1) and submitted to an additional electric 
potential eEx. The resulting potential is depicted by the solid line in Fig. 4-1. 
For a binding energy — Vo the ionization probability is proportional to 


V o/leE| 4 / Ke 
exp -2 | dx ./2m(Vo — |eE|x = ex0(—3 ee cai) 
0 


If we consider a negative energy electron as trapped in a potential well |Vo| ~ 2m, 
we get a factor exp(—constant m?/|eE|) in qualitative agreement with the exact 
result. 


4-3-4 The Euler-Heisenberg Effective Lagrangian 


A classical electromagnetic field may produce pairs as a consequence of quantum 
effects. This means that the dynamics of a classical electromagnetic field contains 
quantum corrections. Until now, we have studied an absorption effect. Dispersive 
effects exist as well. We summarize these quantum corrections in an effective 
lagrangian 

Less => LZ ae of 


with Ly = —1F,,F” = 4(E? — B?) (4-120) 
6L = bL{(E? — B), (E-B)’] 
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where the correction 6¥ is assumed to be local, at least for a slowly varying 
field, and has been written as a function of the invariants E? — B? and (E: B)?. 
We know that the average properties of polarizability in matter are represented 
by the introduction of the electric induction D and the magnetic field H. Similarly, 
the role of 6Y is to take into account the polarizability of the vacuum, Le., 
the interaction of the electromagnetic field with the vacuum fluctuations of the 
electron-positron field. The amplitude S9(A) will be present in any computation 
of an electromagnetic effect, since. it represents the dynamics of the vacuum 
fluctuations. We expect, therefore, that 6 is given by 


<0|S|0> = So(A) = exp [ [a 500) (4-121) 


The real part of 6Y describes dispersive effects and its imaginary part, the 
absorptive ones. The functional 6Y(A) may have a complicated form, but if we 
have only to deal with low frequencies it is sufficient to consider.a slowly varying 
field, or even a constant field to lowest order. The method of computation of 
the last section will then be relevant. For a pure electric field, we find 


00 1 as 
OL(E) = = \, s E coth (eEs) — | em (4-122) 


while in the general case ~ 
Py'ds _ ont | 3, Cosmi@as)icosi(ebs)) 1 
<a aaieaie a Se a ar ca ST 4. 3 
oe) 82? \, oe” E o sinh (eas) sin(ebs) _s? (ae 


where a* — b* = E? — B*, ab = E-B. The reader may also derive (4-123) from the 
expression for the Dirac propagator given in Sec. 2-5-4 using the relation 


6 i 
—— |d*x6¢ =etr > 
56 A"(x) | ia SS: ig —eA—m 

In contrast with (4-117), the integral in (4-123) has a divergent part propor- 
tional to a* — b?, which will be cured by a renormalization. The idea is to rewrite 


5L =4C(E? — B) + 52, 


where C is infinite, and to modify Yo in (4-120) into (1 — C)Lo; this modification 
is undetectable, since only % + 6¥ may be observed by assumption. The latter 
now reads 


Ix) 


E? — B? E? — B? 
Je) eee OS = at Oa (4-124) 


OL snite 18 obtained from 6 by subtracting the divergent term; there is, of course, 
some arbitrariness in this subtraction. We shall discuss at length this renormaliza- 
tion operation and its arbitrariness in the following chapters. 


The absorptive part of 6% vanishes with all its derivatives at e = 0. Only 
dispersive effects may be computed by a perturbative calculation. For instance, 
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we find, by expanding (4-123) to second order in a, 
2a? 
45m* 


‘The Euler-Heisenberg effective lagrangian (4-123) and (4-124) may be used to 
discuss various nonlinear effects due to the quantum corrections. 


5g = 


[(E? — B*)? + 7(E- B)?] (4-125) 


NOTES 


The physical interpretation of the infrared catastrophe dates back to the work 
of F. Bloch and A. Nordsieck, Phys. Rev., vol. 52, p. 54, 1937. A thorough treatment 
within perturbation theory is given in D. R. Yennie, S. C. Frautschi, and H. Suura, 
Ann. Phys. (New York), vol. 13, p. 379, 1961. The use of coherent states to provide 
a bridge between quantum and classical electromagnetism has been pioneered by 
R. J. Glauber, Phys. Rev., vol. 131, p. 2766, 1963. Further developments can be 
found, for instance, in “Quantum Optics and Electronics,” edited by C. De Witt, 
A. Blandin, and C. Cohen-Tannoudji, Les Houches, 1964, Gordon and Breach, 
New York, 1965. 

The interaction representation is due to F. J. Dyson, Phys. Rev., vol. 75, 
p. 486, 1949, while the original work of G. C. Wick is in Phys. Rev., vol. 80, 
p. 268, 1950. 

The effective lagrangian for the electromagnetic field arises from the work of 
W. Heisenberg and H. Euler, Z. Physik, vol. 98, p. 714, 1936, and is worked out 
in detail by J. Schwinger, Phys. Rev., vol. 82, p. 664, 1951, which is the source for 
Sec. 4-3. See also J. Schwinger, Phys. Rev., vol. 93, p. 615, and vol. 94, p. 1362, 
1954. 


CHAPTER 


FIVE 
ELEMENTARY PROCESSES 


We discuss the relation between measurements of cross sections and the dynamical 
content of field theory provided by Green functions through the reduction 
formalism of Lehmann, Symanzik, and Zimmermann. This is followed by 
elementary applications to electromagnetic processes to lowest order. General 
properties of unitarity and causality are presented and used to introduce partial 
wave expansions and dispersion relations. 


5-1 S MATRIX AND ASYMPTOTIC THEORY 


We have to complete our kinematics before embarking on the perturbative 
evaluation of transition amplitudes. We present in this chapter the general frame- 
work which will enable us to relate these calculations to the physical processes 
of interest. 

In practice a major tool of investigation is particle scattering off various 
targets. On a macroscopic scale, interaction times are extremely small. It is there- 
fore out of the question to follow in detail the time evolution during the elementary 
scattering events. We can only give the following picture. Long before the collision, 
well-separated wave packets evolve independently and freely. As already discussed 
in the special instances of Chap. 4, the set of these incoming states builds up a 
Fock space, the in-space with associated free fields. It is important to note that 
these in-states must represent exactly the individual characteristics of isolated 
particles, such as mass and charge. In other words, self-interaction effects must 


198 


ELEMENTARY PROCESSES 199 


be absorbed in these measurable parameters. The collision process then follows, 
involving scattering, absorption, or creation of new particles, submitted to the 
fundamental conservation laws of energy, momentum, angular momentum, parity, 
charge conjugation, internal symmetries, etc. Long after the collision, free wave 
‘packets separate, representing the outgoing states. They are again described by 
free particle kinematics and by corresponding free fields. According to the 
postulates of quantum mechanics the amplitude 


<b, out | a, in> 


enables us to obtain the probability that an incoming state |a> will evolve in 
time and be measured in the |b> state. 

The out-states can represent the incoming states for a successive process. 
This is the case, for instance, when one prepares secondary beams. Therefore an 
isomorphism must exist among the in- and out-Fock space. Our goal is to relate 
the above transition amplitudes to the actual measurements. In other words, we 
want to establish the relativistic conventions of Fermi’s golden rule and give the 
expression for cross sections. 


5-1-1 Cross Sections 


To take a simple case we first envisage the scattering of two distinct, spinless 
particles. The incoming state is written in terms of the incident wave packets in 
momentum space: 


. a ad? ; 
iin) = [setts oa fi(p1) folp2) |p, p2, in? 1) 


With these wave packets are associated positive energy solutions f(x) of the 
Klein-Gordon equation with corresponding mass: 


es a , 
70)= | rape sO 62) 
and a flux given by 
=e. a 
| d?x f*(x) Go f(x) =|sent | f(p)? (5-3) 


The transition probability to a final state | f, out> is 
W,; =|<f, out|i, in >|? (5-4) 


In the absence of external sources, translation invariance implies that the matrix 
elements vanish if energy and momentum are not conserved. Furthermore, as 
discussed in Chap. 4, the assumption of an isomorphism between out- and in- 
states implies the existence of a unitary operator S, commonly called the S matrix, 
such that 


<f, out |i, in> = <f, in| S |i, in> (5-5) 
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Unitarity of S is necessary to conserve probabilities. Strictly speaking, S applies 

the out-states on in-states according to (5-5). We shall loosely say that we take its 

matrix elements in the in-space and unless necessary we will drop the index 
in.” It is then decomposed as 


S=1 ie (5-6) 


where T contains the information on the interactions. If we idealize the final state 
as a plane wave state we can extract the delta function of energy momentum 
conservation according to 


<f|T \p1, p2> = (22)*6*(Py — ps — pr)<f| TZ |p, Po? (5-7) 


and the reduced operator 7 acts on the energy shell. When we substitute the de- 
composition (5-6) into the matrix element (5-5), the identity only contributes to 
forward scattering and represents a part of the incident wave packet unaffected 
by interactions. In most experiments we are only interested in the deflected part. 
This justifies keeping the sole contribution of 7 in the transition probability 


Wri = | dp, dpo dp dp> f*(p1) f¥(p2) filp1) f2(p2) (22)*6*(p1 + pa — pi — P2) 


x (2)*5*(P¢ — pr — pa)<f|T [prs p2>*< S| TF |p, p2> (5-8) 


If the final state is not a sharply defined eigenstate of momentum the above 
formula has to be slightly generalized in an obvious way. In most cases, one 
prepares the initial particles with almost well-defined momenta with a negligible 
width on the scale of the variation of the matrix elements of 7. In short, f;(p;) 
is peaked around a mean value p; with a width Ap; so that 


Gee pi, p2> ~ OAs [P1, Pa> ~ Cf|F |P1, P2> 


Using this approximation and the integral representation 
(27)*5*(p1 + p2 — pi — p2) = [atx ee Pa Ps ree) 
we find 
Wri = [a | Fulce)|? | fal) PQm)*5*(Py — Bi ~ Ba) |< S| F Pr, B2>? (5-9) 


which can be interpreted as a transition probability per unit time and unit 
volume: 


= | fi)? | fod P(20)45*(P, =pi = P2)| CZ |Pi, B2>|? (Sall0) 


Now fi(x) = e7'?:'* F(x) with F,(x) a slowly varying function of x in such a way 
that 


if *(x) b, F(x) = 2p,| F(x) |? (5-11) 
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To be concrete let us assume that particles of type 1 are the incident ones 
in the laboratory frame on particles of type 2 at rest. The number of particles 
in the target per unit volume is dn2/dV = 2p | f2(x)|?, p? = mz. The incident flux 
is given by velocity |p, |/p? times density 2p°| f\(x)|? equal to 2| ps] | ACO. 
This gives the following interpretation of Eq. (5-10): d W,.;/dV dt = transition 
probability from state |i> to state |f> per unit time and unit volume = target 
density [2m,| f2(x)|?] x incident flux [2|p,| | fi(x)|?] x cross section do: 


do = (2n)*84(P; — Pi ~ pa) —— |<f|F bub (512) 
4m, |p. | 

The cross section is the transition probability per scatterer in the target and per 

unit incident flux. The idealization of a final state with well-defined momentum 

is corrected by integrating over an energy and momentum resolution A. For 

instance, if we consider a final state of n distinct spinless particles (Fig. 5-1), the 

cross section is given by 


1 


do ,.2 = > ee 
A[(p1 *p2)° — mim} |*/? 


| dp3°**dpn+2|<ps,--->Pn+2| TF | pr, P2>|? 
A 


x (27)"0"(p1 + P2 = Ps—-**Par2) (5-13) 


We have expressed the factor m2|p;| in terms of relativistic invariants by 
remarking that in the laboratory frame 


ma|p1 | = ma[(pi)? — mi]? = [(p2-p1)* — mim?]"? 


In this way cross sections have been defined as Lorentz-invariant concepts. We 
remind the reader that dp denotes the Lorentz-invariant measure dp = d*p/2p°(2n)° 
for bosons. These formulas assume, of course, that one-particle states are 
normalized according to 


<p’ | p> = 2p(2n)°5*(p' — p) 


that is, are generated out of the vacuum state by canonical creation operators. 
This normalization gives the interpretation of the flux densities expressed in terms 
of the wave functions f. 

When dealing with massive fermions the factor 2p° will be replaced by p°/m. 
For instance, if the two colliding particles are fermions the factor 4 in front of 
(5-13) is replaced by m mz, and if fermions occur among the final particles the 
phase space element dp is to be understood as (m/p°)[d*p/(2z)*]. However, if 


P3 


\ 


Py 
P2 


oe 


, 


Pn+2 Figure 5-1 Kinematics of a general scattering process. 
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massless fermions are involved in the scattering process, we keep the boson phase 
space factors. 


The reader is urged to study the modifications to (5-13) when some of the particles are identical 
and/or carry some spin. What happens if the initial state is not a pure state but has to be described 
using a polarization density matrix? We have presented here the connection between the invariant 
iransition amplitude 7 and the cross section. Another physically important instance is the computation 
of the decay rate of an unstable composite system. This will be illustrated in the case of positronium 
in Sec. 5-2-3. 


5-1-2 Asymptotic Theory 


The central problem in the study of collisions is the calculation of S-matrix 
elements between on-shell states. Lagrangian field theory provides us with 
algorithms to compute these quantities, as we saw in the previous chapter where 
we investigated the behavior of a quantized field coupled to external sources. 
Local dynamics will enable us to scrutinize in close detail this S operator and to 
express it in terms of more elementary quantities, the fundamental Green functions. 
To keep the formulation as simple as possible we shall develop the argument in 
the framework of a self-interacting scalar field. The generalization to more 
elaborate cases, such as electrodynamics, only involves more kinematics without 
affecting the logic. It will be left to the sequel. 

Let us summarize the situation. A Fock space of states is generated from a 
unique vacuum by a free field denoted by ¢,,(x). This is the stage on which the 
whole dynamical process takes place. In the best of all worlds, physical observables 
would be expressible in terms of this unique free field. In particular this ought 
to be the case for the interacting field g(x). Intuitively we imagine the relation 
between these two fields as follows. In the remote past, ¢,,(x) is a suitable limit 
of (x). This refers, of course, to some definite process under consideration and 
only applies when the elementary participants in a collision are well separated from 
each other. To implement this idea we may assume that the coupling terms in 
the equations of motion are affected by some adiabatic cutoff function equal to 
one at finite times and vanishing smoothly when |t|— oo. All physical quantities 
have then to be understood in the limit when this adiabatic switching is removed. 
The adiabatic hypothesis asserts that under these conditions 


Xo > —00 Q(x) > Z*7p,_ (x) (5-14a) 


Two questions immediately arise. What is the meaning of the factor Z‘/?? In 
what mathematical sense do we expect the limit to hold? First, the factor in (5-14a) 
is due to the fact that fields are naturally normalized by equal-time commutation 
relations, and gj, acting on the vacuum only creates one-particle states while 
g will also generate states with extra pairs (assuming the lagrangian to be even 
in gy). The amplitudes <1] Qj,(x)|0> and <1] (x)|0> have the same functional 
dependence on x dictated by the kinematics. The normalization factor Z!/2 there- 
fore takes into account that the content of the state (x) |0> is not exhausted by 
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the matrix elements <1|¢@(x)|0>, as would be true if g was replaced by gj,. 
Intuitively we thus expect Z‘/* to be a number between zero and one. 

Moreover, the limit involved in Eq. (5-14a) can only be a weak one, ie., valid 
for each matrix element separately. If not, we would conclude that the commutator 
of two fields ~ is equal, up to Z, to the corresponding c number commutator of 
free* fields. In such circumstances canonical quantization would require Z = 1 
and we would soon discover that ¢@ is a free field! This shows that the adiabatic 
condition has to be treated with care. The constant Z can be related to a 
spectral representation of the commutator of two interacting fields, due to 
Lehmann and Kallen, as follows. Consider the vacuum expectation value of this 
commutator. Assume the field hermitian for simplicity. Using translation 
invariance 


<0] [o(»), Gy] |0> = FY [<0] e(O)|a> e7? =—™ Cal pO) |0> — (x y)] (5-15) 


where the sum runs over a complete set of positive energy states «. To compare 
it with the commutator of two free fields of mass m, 


iA(x — y;m) = | a e(q°)6(q? — m?) e748 &-» (5-16) 


we insert in (5-15) the identity 
1= | d*q 5*(q — Pa) 
and obtain 


d*q es 
<O| [ p(x), g(y)] |0> = las p(q)(e7 i &- — gia & ») 


We have introduced the density 
p(q) = (2)? © 54(q — pa)| <0| (0) |a>|? 


It is obviously positive and vanishes when q is not in the forward light cone; 
furthermore, it is invariant under a Lorentz transformation as required by the 
corresponding property of the field @. It can therefore be written 


p(q) = 9(q7)0(q°) — with o(q’) = O if q* <0 


In general this is a positive measure where 6-function singularities might possibly 
occur. We finally obtain a superposition of free commutator contributions with 
positive weights (Kallen-Lehmann representation) 


<0| Lex), e(y)] |0> =i | on a(m’2)A(x — y;m’) (5-17) 
0 


Following the same line of reasoning it is easy to see that a similar result holds for the vacuum 
expectation value of the time-ordered product, with the same spectral function p. 
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We can explicitly separate in (5-17) the contribution of the one-particle state 
by using the asymptotic condition to write 


<0| [e(x), (y)] |0> = IZA(x — y; m) + ‘| ; dm’? o(m’7)A(x — y;m') (5-18) 
Here m stands for the mass of the particles and m, >™m is the multiparticle 
threshold. If the interaction lagrangian does not involve field derivatives, @ will 
be conjugate to @. By taking the time derivative of both sides of Eq. (5-18) and 


identifying the coefficient of 6°(x — y), we find that canonical quantization implies 


1=Z+ | _ dm’? o(m’?) (5-19) 


Positivity of o implies correctly that 
= 71 (5-20) 


The value 1 is excluded if some matrix element of ¢ different from <1| @(x)|0> 
and <0| p(x) |1> does not vanish. Clearly Z = 1 requires ~ = Qj,. 


From the Kallen-Lehmann representation it follows easily that the asymptotic limit cannot be 
understood in the strong sense. We can also check that as | x° — y°| + x the dominant contribution 
arises from the one-particle contribution, the remaining terms being damped by strongly oscillating 
factors. By the same token (5-19) puts bounds on the density o(m*). Many-particle states will give 
to o(m”) a support extending to infinity and there will be a strong tendency to violate the positivity 
of Z. 


In a manner analogous to the asymptotic limit (5-14a) in the remote past, it is 
expected that we also have 


Xo > +00 Q(x) > Zh u(x) (5-14b) 


where @ ,, 1s again a free field with the same mass m as g,, and the same 
constant Z'/*. Uniqueness of the vacuum implies |0, in> = |0, out> = |0>(a possible 
relative phase is conventionally set equal to one). Moreover, we assume that one- 
particle states are stable. Under these conditions |1,in>=|1,out>. Since 
<0| p(x) |1> has the same dependence on x as the corresponding matrix elements 
Of Gin OT Poy, the normalization of which is fixed by their free field character, 
we necessarily have 


CO] p(x) |1> = 2"? Ol gin(X)|1> = Z"/? 0] @our() [DD (S221) 


The S matrix induces the isomorphism between in- and out-states. From 
Eq. (5-5), it follows that 


Pin(x) = S@out(X)S~ : 
Ein == S |i out) (5-22) 
<f in| S |i, in> = <f, out| S |i, out> 
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Moreover, 
<0|S|0> = <0|0> = 1 
ys |i > = <1 |1> 


where indices have been omitted from the one-particle states and we have only 
translated the stability of these states. 


The unitary $ matrix has to commute with Poincaré transformations to 
implement the covariance of the theory 


U(a, A)SU~(a, A) = S (5-24) 


(5-23) 


One can also express deeper locality properties of the S matrix if, following Bogoliubov and Shirkov, 
one allows for space-time dependence of the coupling constants in the lagrangian. We refer the reader 
to their textbook for a systematic use of this approach. 


5-1-3 Reduction Formulas 


Here we shall use the asymptotic conditions (5-14a, b) to relate S-matrix elements 
to the general Green functions of the interacting fields. Consider the transition 
amplitude <p,,...,out|qi,...,in>, where for simplicity we have omitted the 
smearing test functions necessary for normalization. We shall reduce this element 
by extracting one by one the in- or out-creation operators which allow us to build 
the initial or final state. By definition 


Qa..-, OUL|qi,.2:,40) = <pi,---,out| af,(q:)| q2,--.,in> 
aie | 
= jax e094 p,,...,0ut | Pin(X)|G25--+,in> 
ig 


The last integral is performed at an arbitrary time ¢. Let us therefore choose a 
large negative value to enable us to substitute Z~‘/*@ for @in: 


Gipe, OWE) gy... : 5 in) 


: . 1 ; 
=wlineZ [ex eH —"OoKp1,..., out | p(x) | ga,---»in> 
t 


boo 


Now for an arbitrary integral 


eal) 
( lim — lim jax W(x, t) = lim [ad ax W(x, t) 
t~+o t+— ©, tpt t;%—O Vt; t 


Using the asymptotic condition for arbitrarily large positive times we find 


<pi,..-,out|qi,...,in> = <pi,..., out] abur(q1)|q2,---, in? 


ee ae | atxaoter™ BoCD1, ..., out | o(x)| q2,---,in?] (5-25) 
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where the integral extends over all space-time. The first term on the right-hand 
side of Eq. (5-25) is a sum of disconnected terms (corresponding to situations 
where at least one particle is not affected by the collision process): 


<p1,...,out|al,.(q1)| G2,-»-,in> = ). 2pe(2n)°d7(p, — 41) 
: i 


A nee ay ee 


We have assumed here that all particles belong to the same species. The necessary 
changes when this is not the case are obvious. Observe that disconnected terms 
disappear when none of the initial and final momenta coincide. 

Let us now take a closer look at the second term on the right-hand side 
of Eq. (5-25). Since this is understood as a kernel to be tested with wave packets, 
partial integrations over space variables are legitimate. This is, of course, not the 
case for time variables, otherwise all the calculations would collapse. With this in 
mind, denoting by m the mass of the particles and recalling that gj = m’, 


[atx doe '" arn <B, out | p(x)| «, in>] 
- [ass {{(—A + m?) e-'*"*] ¢B,out | p(x) |, in>+ e7 '""*23<B, out! e(x)| a, in >} 


= Ge e~'%"*(T] + m?)<B, out | o(x)| «, in> 


The first stage of the reduction formula takes the following form: 


Piss ss Pas OOO gee =, 1, 101 


= 2p ¢ (27) Pop. — 41)<P1, ++ +5 Pks+++>Pn, OUt| qo,...5 41, in> 


1 


+ iZ [ats ee (CJ ai fa Vp. -+5Dny out | ~(x)| qa,. -+5 QI, in» (5-26) 


The same steps can now be repeated again and again. To be specific, we shall do 
this for a particle in the final state. As far as the disconnected terms are concerned, 
nothing new emerges and we shall omit to write the corresponding contributions. 
This is, however, not true for the second term, for which we may write 


<Pi,-.-,0UL| p(x1)| q2,---,in> = €P2,---, Ut | Gour(D1) (x1) | g2,.-., in 
= lim iZ; ve [ancrn’, 
ree yi 

x €P2,.--,0Uut| P(y1)—(x1)| go,...,in> 


We would like to replace the last integral by a four-dimensional one as we did 
before. Clearly some trick is needed in order for the operator a,,(p,) coming 
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from the lower bound of integration to operate directly on the state | q2,...,in). 
The time ordering of the fields 


Tee. oe) = aa Be 


P(X1)p(y1) x yf 


has just the right property of setting the operators in the convenient order relevant 
for the two limits. Without affecting the previous expression we can therefore 
substitute the T product and proceed as in the first step with the result that 


<P1,---,0ut| p(x1)|qo,...,in> = <P2,-.-, out | p(x1)ain(p1)| G2,.-.,in> 
7 V2 [ety 2’ (CL ,, + m7) <po,..., out | Te(y1) (x1) | qa,--.,in> 


On the right-hand side the first term will yield disconnected terms. 
Once two particles have been reduced the answer looks like 


Cie, OWE | Qrmeesreryit = piers diay |S gage: agit > 
= disconnected terms 


Ss GZ ee [etn Beye 2! 70a) ae me, 4 m?) 
x <p2,.--,out| Te(y1) (x1) | q2,...,in> (5-27) 


Disconnected terms obtained so far involve one or two 6° functions. The same 
reasoning can now be carried further until all incoming and outgoing particles 
have been reduced 


Gre tape Out | qi, .--, Gite — <P i,-.-; Pest |S | Qi,--.57, Out 
= disconnected terms 
n I 
ON (A a ae jay ... d°x, exp (: > Pet ye — >, a%) 
1 r 


x (Dy, + m?)-+-(Dx, + m) <0| Te(y1)-- e(x) [0 (5-28) 


The remarkable feature of these expressions obtained by Lehmann, Symanzik, 
and Zimmermann is the relation they provide between the on-shell transition 
amplitudes and the general Green functions of the interacting theory. The latter 
are precisely the vacuum expectation values of time-ordered field products. 
Relation (5-28) implies that in momentum space the Green functions have poles 
in the variables p?, where p; is conjugate to x;. Up to a normalization constant, 
the S-matrix element is nothing but the residue of this multiple pole. 

We also notice a remarkable symmetry between incoming (q1, ...) and outgoing 
(p;,...)momenta. It is convenient to replace the outgoing set (p;,...) by an incoming 
one (—p1,...) with negative energy component. In this way all momenta are on 
the same footing. 
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5-1-4 Generating Functional 


It is possible to obtain a compact operator form for the set of relations (5-28). 
To simplify notations let us absorb a factor Z~'/? in @ in such a way that 


within this subsection the normalization of g is taken as lim @ = @ou. We shall 
t+ £00 in 


also use a unique symbol for creation and annihilation operators, in agreement 
with a previous remark, by setting for an arbitrary four-vector k on the mass 
shell k? = m?: 


out 


| din (k) ko >0 


at (k) = (5-29) 
in ( ke (—k) k° <= 0 
out 
in such a way that 
ais, (k) = ie(k®) [ex ea, Pin (x) (5-30) 
out out 


Let B,,...,B, be local Bose fields. Equation (5-28) is equivalent in operator form 
to the relation 


[ST[B1(x1)-** Bu(%n)], @in(k)] = —ie(k°)S ay eX (DY, + m’) 
. x TLo(y)B(x1)°**Br(xn)] (5-31) 


The proof Hoon easily by taking in-state matrix elements, using a®S = Sagi, 
and expressing a* through (5-30) in terms of the field. The desired relation is then 
obtained as above by replacing boundary terms by an integral over the time 
derivative. Iteration yields the more general commutator 


le ~ [ST[Bi(x1) ie Baas ais(k1)], tees ain(ky) | 
= (—i)Pe(k?) + e(ky) ay Sede, OXP ( ke), +7 )** (Ge ie) 


x ST[o(y1)*** Pp)B1(1)*** Br(Xn)] (5-32) 
This could be applied to the case of several types of particles and fields. 


To understand the use of (5-32) as a mean of generating S itself, let us remind ourselves of the 
construction of Fock space as a product of Hilbert spaces, each one corresponding to a definite 
mode (indexed here by the momentum k). For a given oscillator, with fundamental operators such 
that [a,at]=1 and a vacuum state |0>, the normalized excited states are \n> = an a 
The projector |0) <0| has the following representation: 


|0><0| =: e722: (5-33) 
This is indeed a hermitian operator leaving the vacuum state invariant and such that 
o(- a a ~ (~ a'y'a opt 
ena, t a’, at Are n= Ge 
[rene at] = ST Lah at] = YO at sere 


t 
or ea: gt = 0 
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Consequently, : e~ 4": [n> = dyo |n>, proving (5-33). 
Now let Q stand for an operator sufficiently regular to give a meaning to the subsequent 
operations. We expand it in terms of its matrix elements as 


0 
oe |m><m| Q |n><n| = ee <0| a"Qa'" |0> 
ED GAED Copue : : 
er es n! So — (0 ee o >|, nace 


In this expression « and « are taken as independent variables while the operators in front of the 
matrix element are in normal order. Therefore we obtain 


Q = :et dat ad/0a—ata. <0| 7 Q efat \o>| 


a=a=0 


This is not quite the desired result where we would like to find the matrix elements of Q between 
normalized coherent states, i.e., states obtained from the vacuum by the action of the unitary operator 
e#a'—aa However, the relation 


etal — na |) = e~ ai/2 gia! la» 
enables us to rewrite 
Q=: 2! d/a+ad/a%—a'a. pad <o| exa—aat Q eat —ca jo>| 


a=cZ=0 


Moreover, we may commute e* with the derivatives acting to its left in order to be able to use the 
condition « = 4 = 0 at the end. Under the normal product sign a and at commute and we note that 
e4’ “0x translates a by an amount a‘. These remarks lead to the final expression 


Q = :e0" 0/da+ 00/08. <o| eta" ¢) pfta'—aa 10> |, ee (5-34) 


It would, of course, be incorrect to conclude superficially from (5-34) that the sole knowledge of 
diagonal matrix elements of Q in the coherent state basis is sufficient to reconstruct the operator 
itself. As the formula indicates x and & have to be taken as independent variables in order to carry 
out the required derivatives. 


Returning now to the physical Fock space with infinitely many degrees of 
freedom we can write the unit operator as 


co 


imi. oe 
I= ¥ | es Agape oe (5-35) 


p=0 


where the initial term is understood as the vacuum projector |0><0|, given by a 
generalization of Eq. (5-33), namely, 


10><O| = : exp E {a ash) an(| (5-36) 


so that the S matrix can be represented as 


Ss = exp [ak aa } a a din(k) a5 


x <O| exp [or [ain(k)a(k) — af,(k)a(k)]S 


(5-37) 


x exp [ek [af,(k)a(k) 


210 QUANTUM FIELD THEORY 


in complete analogy with (5-34). Due to our normalization convention the measure 
d°k and not dk appears in the coherent state exponent. We shall now combine 
this result with Eq. (5-32) applied to the case where the B are set equal to unity. 
Introducing a source on the right-hand side of this equation it takes the simple 
form 


e(k)-- - &(k) [ai(ky 5 eae kp), ole | 


-s{i[Jenercnenig, 


Since the vacuum is stable, that is, $ |0> = |0>, it then follows that 


Sr expi [as j(x)e(x) 
j=0 


<0| exp | d*k[ain(k)a(k) — al,(k)a(k)]S exp | dk [af(k)a(k) — ain(k)o(k)] |O> 


= exp {| ay &(y)(C, + m? \5 ry sao T exp i | aeione 


We have written &(y) for the solution of the homogeneous Klein-Gordon equation 
expressed as 


dy) = [ee [o(k) ef? + a(k) e*7] 
Inserting this matrix element into Eq. (5-37) we see that 


S =:exp {ary Gin(y)(O, + mo <0| T exp i | aio x)|0> (5-38) 


dj(y) 


We have naturally been led to introduce the free in-field with its Fourier de- 
composition [reciprocal of (5-30) | 


j=o0 


oad) = [ak Lane ® + als 7) 


We should remind the reader that a proper normalization of the field @ requires 
its replacement by Z~ ‘7g in the last T product. The outcome of this lengthy 
algebra is a compact relationship between the S matrix and a generating functional 
for the interacting field Green functions 


Z(j) = <O| T exp i | d*x @(x)j(x) |) (5-39) 


This bears some formal resemblance to the external source problems considered 
in Chap. 4. The difference is, of course, that we do not have any simple closed 
expression for Z(j) in the present case. 


The functional Z(/) lies at the heart of field theory. We might be tempted to think that it contains 
far more information than is needed to compute the on-shell transition amplitudes. With present-day 
techniques, however, this is not true for several reasons. The full Green functions are needed to 
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remedy in a consistent way the ultraviolet infinities occurring in the perturbative approach. Off- 
shell equations are also required to discuss bound states. Furthermore, short-distance behavior of 
interactions is ideally studied in the field theoretic framework. Nevertheless, notable achievements 
are to be credited to the S-matrix approach which abstracts from a relationship such as Eq. (5-38) 
only very general properties of scattering amplitudes and proceeds from there. Section 5-3 will give a 
simple-minded sketch of this line of thought which lies beyond the scope of this book. 


5-1-5 Connected Parts 


Some disconnected parts in the transition amplitudes (but not all) were not shown in Eq. (5-28) but can 
now be thoroughly studied with the help of Eq. (5-38). In simple terms some subsets of particles may 
interact independently from each other in a collision process. It is therefore natural to look for a 
definition of connectedness in such a way that an S-matrix element is the sum in all possible ways of 
products of connected ones involving subsets of interacting particles: 


<pP|S\a>= > _, Lf Pal Slay, (5-40) 


i,=!, HEL 


We use a shorthand notation for a collection of indices, | p,;> = lees a'(p;)|0>, «(p)) = Lie a(p;). The 
connected matrix elements are defined recursively by the right-hand side of the equation where the 


sum runs over all partitions of the set of initial and final particles. If |J| denotes the number of 
elements of J, we define 


s(,g= } = TI 43p: [] 424; «(p1)< pil S*|qs>a(4s) 
[I ! |J |! jeJ 


ih, 1st ie (5-41) 
= <0| exp \@p a(p)a(p)S‘ exp [ea c(q)a"(q) |0> 
with s°(0, 0) = 0. Similarly, 
s(a, &) = <O| exp [ep a(p)a(p)S exp [ea d(q)a‘(q)|0> (5-42) 
such that s(0, 0) = 1. The recursion relations (5-40) are simply interpreted as 
s(a, &) = e\*4) (5-43) 
If we use the definition (5-39) for the generating functional Z(j) we may write 
r) : 
s(x, @) = exp | |e 2k (27) a(ka(k) + [es a(y)(C, + m?) saz 7” (5-44) 
ie 
Comparing (5-43) and (5-44) it is natural to define the connected Green functions through 
Z(j) = eS (5-45) 


Since exp a) d*y &y)(C, + m?) 6/dj(y)] is a translation operator, we find by combining (5-43), (5-44), 
and (5-45) the relation 


S*(a, &) = {os 2k°(2n)?a(k)a(k) + G.[(O + m)a] (5-46) 
Let us make this formula more explicit. In the same way as we write} 


+ For the 2-point function this differs by a factor i from the standard choice in the free-field case 
[compare for instance with Eq. (3-90)] but it is convenient in the general discussion. We hope that 
the reader will not find it difficult to convert from one convention to the other according to the 
context. 
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wo : 


= <O|P expt [es e(xXj(rx)|0> = ¥° a ats “Ae ily) JCM.) (5247) 
0 Nn: 


with Got, 0-5%_) = <0] Tees): ++ @(%n) [0 
let us define 
G.(j) = - {ats Xy°** re Oay *H(Xn)GA(X1,.- + Xn) (5-48) 
1 


Then the meaning of Eq. (5-46) is that 
ore) rh 

Soya [er 2k°(2n)3a(ka(k) + = ley, sd *yq Gys)*** H(Yn) 
1 LU 


x (Oy, ate m*)"*(Giy, ate m?)Ge(y1,--+» Yn) (5-49) 


If we insert the definition of & and the expansion (5-41) we see that for n =|/| +|J|> 2 we obtain 
the basic reduction formulas 


{I lJ] n 
<p1|S*|qs> = rl dy ay, exp (ix Disa Viaaad yy ay-duies) TI I] ¢ am +m? GAGiee) (5-50) 
1 1 1 


expressing connected S-matrix elements in terms of connected Green functions. When compared to 
Eq. (5-28) we see that obviously disconnected terms have disappeared. Extra disconnected ones have 
also been suppressed as G, has Leen substituted for G. Recall that the correct normalization of @ 
requires dividing the right-hand side by Z”?. The case n = 2 has been set aside. According to (5-40) 
we have 


<p| S*|q> = <p| S|q> = 2p°(2n)°5°(p — q) (5-51) 


Moreover, we have to assume that G(x) = 0, meaning that the vacuum expectation value of the field 
vanishes. Using the Kallen-Lehmann representation (5-17), the two-point Green function may be 
written 


Gx, y) = G(x, y) = <0] Te(x)e(y)|0> 


© d*k 1 
=; d 2 2 Se STS) 
|, Boole fo k?—p?+ie- 


implying that G,(x, y) depends only on x — y and has a single pole in the Fourier transform variable 
at k? = m?. This means that 


[an | ety. eh as (O),, ar m?)(Oy, ate m)GAy1, y2) = 


Hence, only the first term on the right-hand side of Eq. (5-49) contributes to <p| S‘|q> and we can 
readily verify that relation (5-51) follows. Even if G(x) # 0, translation invariance requires that it be 
x independent so that G,(x, y) differs from G(x, y) by at most a constant and the preceding reasoning 
still applies. The reader will have no difficulty in proving that 


Cixi,..., xv >. [| GAx1,) (5-52) 

vVi=I a 
where J = (1,...,). Thus a natural definition of connected S-matrix elements has led us to a definition 
of connected Green functions independent of any diagrammatic expansion. A more elaborate 
discussion will be given in Chap. 6. We note that sometimes the convention that Z(j) = e@) is used. 
Let us conclude by noting that in an elastic two-body scattering process the separation into connected 


parts just corresponds to setting S = J + iT and retaining only T in the transition amplitude used to 
construct the cross section. 
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5-1-6 Fermions 


We generalize the preceding formalism to the case where spin 4 fermions, described 

by the interpolating spinor field w(x), are present in the initial or final states. The 

normalization constant affecting the field is traditionally called Z, in electro- 
‘magnetism so that in the weak sense 


W(x)—> 22!" Woulx ) (5-53) 


t-- to 
The vacuum expectation value of the anticommutator has the form 


iO] {Walx)s Waly} (OD = iY [<0] Wa(0) [><] Hp) [0 eo PO 


+ <O|Wp(0)|n><n| y_(0) |0> es &-7] 
The matrix spectral density 


Pap(q) = (21)° ¥ 54(pn — q)<O| Wa(0)|n><n| We(0) |O> 
can be expanded as a combination of the sixteen independent 4 x 4 matrices. 
From relativistic invariance it follows that 


(q) = pilq?)d + p2(q7) + p3(q)dy* + palq?)y? 


If we deal with a theory invariant under parity, as is the case for electromagnetism, 
a unitary operator F exists such that 


PIO>=|0) PW(A)P™* = VHX) 
This leads to the condition 
p(g=7°p(ay® gg = (4°, — 9) 
and requires the terms involving y° to vanish, that is, p3 = pa = 0. Consequently, 
p(q) = p1(q?)d + p2(q?) = pig?) =0 if q? <0 


From PCT invariance the antiunitary operator © leaves the vacuum invariant 
and acts on the fields as 


Oy.(x)O~! = ySph(—x) — OPAxO7* = — iyo —X)(° "Ip 


This enables us to relate the second term in the anticommutator to the first 
according to 


<O| Va(v)Walx) |O> = — 73.40] We —x)Pe(—y) O72 


Hence 


i<O| Wal), Wa(y)} |0> = ae ; 0(q°)[p1(47)id x + p2lq7 hap 


x (ene) = ga (ey) (5-54) 
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Using the support properties of pi(q’) on the positive real axis we find a super- 
position of free-field anticommutators: 


i<O| {Wi7(x), Wi2()} [OD = —(i9, + m)A(x — y, m) = S(x — y, m) (5-55) 


; a*k —tk + (x—y) ik « (x—y) 
hex ym=i| aesle =e ) 


In the interacting case we find 


i<O| {Wa(x), Wa(y)} |O> = — ‘a du? [py(u?)idx + p2(u?) Jap A(x — y, Ww) 


‘a du? {p1(u7)Sap(x — y, p) 


+ [up 1(u*) — pr(u?)}oapA(x — y, »)} (5-56) 


It is understood that y is the positive square root of u?. An analogous formula 
holds for the propagator with the same spectral densities p, and p2: 


ak | gaia Kp.(u?) + p2(u?) 
<0| TW(x)W (y)|O> = a | ate Boi in du Z 2 — ee (5-57) 


From the relation p'(q) = y°p(q)y° it follows that p, and p2 are real. Moreover, 
since the quantities tr [y°o(q)] and tr [y%¢ — u)p(q)(¢ — u)] (with q? = py’) are 
proportional respectively to the positive quantities }°,|<O|W.(0)|n>|? and 
Ya | <0] [9 — 2 we have the following positivity conditions: 


piu’) >0 — wpi(u*) — pa(u’) = 0 (5-58) 


Finally, it is possible to extract the one-particle contribution. Using invariance 
under parity, we come to the conclusion that up;(u*) — p2(u?) has a support only 
on the continuum states, so that in (5-56) the first term only receives a contribution 
from the discrete one-particle state for 1 = m. This assumes, of course, that we can 
isolate a one-particle state from the continuum, and is not valid in the strict sense 
with massless particles such as photons present. The small fictitious mass of the 
photons will therefore be removed at the final stage of the calculations after coping 
with possible infrared divergences. Otherwise the single (charged)-particle pole 
becomes a gauge-dependent branch-point singularity, and the treatment gets 
slightly cumbersome. 

Assuming, therefore, that p, contains a term of the form Z6(u? — m?) and 
that m, stands for the continuum threshold, 


<0] (W(x), WO)} |O> = Z2S(x — y, m) — | An [p (Hu? )idx + p2(u7)JAC — y, uw) 


(5-59) 


If we let x° = y® the left-hand side reduces to the canonical anticommutator 
id°(x — y)y%. Using the identities 
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A(x — y, W)|xxxyo =O = ApA(x — y, p)|xo=yo = —5°(x — y) 
it therefore follows that 


l=Z2+ | ; du? ps(u’) 


which, combined with positivity (5-58), leads to the same conclusion as in the 
scalar case, namely, 


02751 (5-60) 


From this point, we can proceed to the reduction formulas as in Sec. 5-1-3. Dirac 
operators replace Klein-Gordon ones. The free field has the Fourier decomposition 


= (ea oy sissy es 
Win(x) aa |G k° 2 [din(k, e)u(k, ele ae di,(k, e)v(k, 2) é ] 


with fixed helicity spinors satisfying v(k, €) = Cu’(k, ¢). The creation and annihila- 
tion operators are expressed as 


bin(k, €) = | dx ia(k, 8)e*** yWin(x) 


t 


di,(k, €) = \ d°x o(k, €) eo *** yWin(x) 
(5-61) 
bia(k, €) = \ d°x Wink, e)y° e~ ** *u(k, &) 
t 
dink, é) = \ d°x Wink, e)y° (eis ; v(k, é) 
t 
A sign ambiguity affects one-particle states of given helicity and momentum. 


Ignoring finite size wave packets we can write for an electron, say, in the initial 
state [compare with (5-26)] 


<out| bi,(k, e)|in> = lim Zz"? [es Cout| W(x)y® |in> e7 ** *ulk, €) 
= Cout| biu(k, €) |in> — iZ~ 1” [ex cous W(x) |in> 


y i apy - 1 Hh soe 
x = B0y°ulk e)e"*** + Cout| p(x) |in> = 80) ulk, Be = | 


The spinor u(k, ¢) e~ *"* satisfies the homogeneous Dirac equation, so that its time 
derivative can be replaced by space derivatives. Integration by parts leads to 


<out] bin(k, @) jin = dise. — 123" [ats <out| P(x)|in(— 74, — m)ulk, &) e“™* 
(5-624) 
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Here disc. stands for a disconnected term and (5-62a) expresses the result of 
reducing one particle in the initial state. We have corresponding expressions 
for the reduction of an antiparticle in the initial state, or for a particle or anti- 
particle in the final state. These read successively : 


<out|di.(k, e)|in> = disc. + iZz 1? [as d(k, 6) ee” *** (i¢,, — m)<out| (x) [in 
(5-62b) 


Cout| Dour(k, €) |in> = disc. —iZz \/ [atx u(k, e) e***(i¢, — m)<out| W(x) |in> 
(5-62c) 


Cout| dour(k, €) |in> = disc. + iZz */? [at <out| W(x) |in>(—i ey — Mt(kes) eo 
(5-624) 


When comparing (5-62a) and (5-62d), corresponding to an electron entering the 
interaction process or a positron leaving it, we observe the characteristic 
substitution 


u(k, e) e~ *°* + —v(k, e) e*** 
discussed in the hole formalism (Chap. 2). The positron in the final state is 
equivalent to an electron with an opposite energy momentum in the initial state. 
The same remarks apply to the pair (5-62b) and (5-62c). 

It is now easy to continue the reduction process, using the same trick as before, 
namely, a time-ordered product, except that we have to keep track of the anti- 
commutation properties of fermion fields. To simplify the general expression we 
omit the explicit polarization dependence of spinors and operators. 

Assume that particles labeled (k,,...)and antiparticles (k,...) are the incoming 
ones, and particles (qi,...) and antiparticles (q/,...) the outgoing ones. The 
conjugate space-time variables are respectively denoted x, x’, y, and y’. Finally, 
nand n’ stand for the total numbers of particles and antiparticles. The scattering 
amplitude takes the form 


out| er dout(qs)* a Bout(q1)bia(k1): _ di(k,)*- a |in> 
= disc. + (—iZz */?)"(iZz V2)" [ax vee dtyt-- 
x exp[—iX(k-x +k -x’—q:-y—qd-y)|-- 
X fH(qs)(i Gy, — mm) HKG, — m)<O| TE D4) + Worle)“ WX) “J 0 
x (Zoe — mula) a i dy, — m)v(qi)°*: (5-63) 


By changing perhaps the sign, we can of course reorder the w and w fields under 
the T symbol. 
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The generalized Green functions are the vacuum expectation values of the 
time-ordered products of fields. The underlying hypothesis is that to each species of 
particles there corresponds not only an asymptotic free field but also an interacting 
(or interpolating) field. Some of these fields could be composite in terms of the 
elementary lagrangian variables if bound states can be generated. This is a non- 
trivial situation which requires a thorough investigation. Such bound states 
manifest themselves as poles of some Green functions in the corresponding energy 
momentum variable. 


Define the connected Green functions for processes involving fermion fields and discuss the corre- 
sponding generating functionals. 


5-1-7 Photons 


We expect additional complications in the case of photons because of gauge 
invariance. Following the Stueckelberg approach we introduce a fictitious mass yu 
and an indefinite metric Hilbert space. Physical vector photons have the mass pu 
while the scalar ghost states have the mass m(u?/m? = A). The interacting field 
satisfies the modified Maxwell equation 


(ade — (1 —A)o"0- A= iP (5-64) 


while the conserved current j vanishes in the case of free fields. The field conjugate 
to A, is 


n? = G°A° — B°A? — Ag? A (5-65) 


Consider the vacuum expectation value of the field commutator. Inserting a 
complete set of positive energy intermediate states we obtain the general form 


co Oa ae ae rie 
<O| [A,(x), A,(y)] |O> = -| dM? | aire k-(x-y) _ pik (x—y)) 


x Ee 2\9 5) — T2(M”) te (5-66) 


It is understood that k° stands for (k? + M?)!/? and the lower limit of the spectral 
measure support is larger or equal to inf (u?, m). As a consequence of current 
conservation the scalar component is a free field 


(0 + m’)d-A=0 (5-67) 
We therefore write that 
<0| [(D + m*)@ «-A(x), Av(y)] [0 = 0 
This yields the relation 
(m2 — M?)[72,(M2) — 12(M?)] = 0 


7 ; 2 
which means that 7, and 7 differ at most by a term proportional to 6(M? — m’). 
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For convenience we set 


7\(M?) = 2(M7) 
12(M7) = n(M7) — <x 6(M? — m’) 


Hence 


ee aek a Se — 
<0| [A,(x), A,(y)] = -| sails | sere k-( en ») 


k ky 2 ili. 
(eM) 28eawe] 


which exhibits a transverse part weighted with n(M7) and a longitudinal one 
concentrated at M* = m?. 

We insist now that the equal-time commutator vanishes. On the right-hand 
side the corresponding contribution is proportional to a gradient of a 6 function. 
Setting its coefficient equal to zero yields the sum rule 


co ed 
Z32 = | dM? n(M2) <a (5-69) 
0 


Next we evaluate the equal-time commutator of the potential with its conjugate 
field. Using Eq. (5-65), a simple calculation yields 


£0 [A p62) 2%(y)] |0>|a0= yo = 153K — res \s dM? x(M?) 


+ 9°98 |Z _ ii dM? “ty! (5-70) 
0 


We would like this commutator to be canonical, which means equal to ig}6°(x — y). 
As far as space components are concerned this is achieved by setting 


| dM? n(M?) = 1 (5-71) 
0 
In order that this result extend to the time components we would have to 
satisfy 


Z3z= | dM? n(M2) 
0 


This is, however, incompatible in general with Eq. (5-69). We are going to see that 
m(M7) is a positive measure. Therefore the validity of both relations leads to a 
measure of support M? = p? and can only hold for a free field. The best that can 
be achieved is to allow for different normalizations of the space and time canonical 
commutation rules, i.e., to assume that 
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[A,(x), n*(y)] | ,0 = yo a id°(x a y)(g," ar ag p09’) 
(5-72) 
oO We _ M2 
a=Z3z—-1 =| dM? n(M*) —.— 
0 M 
Up to now, no reference has been made to the asymptotic condition. Since 
dynamics does not affect the various parts of the field in a similar fashion, it 
would be clearly unwise to assume that (even weakly) A,(x) tends for x° + —co 
to a free field up to normalization. It is more realistic to define a transverse 
component of this incoming field as 


: : 1 ; 
Ap''"(x) = Ap(x) Heed 0,0° A(x) (5-73a) 
and to assume 
. 1 
AjGr-— Z 3” [as Sie me 6,0° 4a) | (5-73b) 


This condition entails in particular that 
1 : 
COWARGS |! > S924 «ol a(x) | > — 27/2 mi ,<0| + Ai"(x) 1 | (5-74) 


For the three transversely polarized states only the first term contributes, while 
only the second matters if we deal with a scalar state. The one-particle contribution 
to the commutator can then be worked out as 


d*k Sih = (eS ike (x— 
<0| [A,(x), Ar(y)] o— = —-Z3 {ave | seer a 


We \ <2 Kpks 
x owe = (op = “) a We 6(M? — m?)| (5-75) 


If M2 stands for the continuum threshold we obtain 


<0| [A,(x), Av] |0> 


d>k r ‘) 
— = yp atk (x= y) 2 tks (x= y) yo 
Z3 | gore (e é ) (0, ie 


d*k —ik+(x—y) ik -(x—y) k pky 
- 232 | roar ls 


k= (k? +p)? 


ko = (k? + m?)*/? 


as d*k ee k pky 
-| dM2 ni) | ape fee (xy) _ ek: ( ») @ — tas) 
MB 


k° =(k? te M?)'2 
(5-76) 
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i=7e | dM? n(M?) (5-77) 
Mj 
0 ie 
Z(z—1)=| dM? (M2) —; (5-78) 
M3 M 


Accordingly, the deviation from canonical behavior is due only to the continuum 
contribution and vanishes for a free field since (5-72) reads 


00 2 M? 
= | dM? 1(M2)=—_—— (5-79) 
M2 M 
The corresponding decomposition for the covariant propagator is 


aa, I pv — Kykv/p? k oky/p? 
ESO] Tea ah >= |e 43 k? — yp? + ie i ie 


Ss v — kyk,/M? 
= i, dM? n(M7) a (5-80) 


(see the discussion of the problem of covariance at the end of this subsection). 
Even though the longitudinal component is dynamically decoupled it turns out 
that the covariant expressions written above imply a nontrivial normalization 
factor Z3z for this term; Eqs. (5-77) and (5-78) do not allow us to set z = 1 or 
Zaz — | 

As soon as we have shown that the measure 2(M7) is positive, in spite of the 
indefinite metric state, the physical interpretation of Z3 will become clear, as it will 
play a role analogous to Z in the scalar field case. 

To do so, we extract from Eqs. (5-64) and (5-76) the vacuum expectation 
value of the current commutator: 

d°k 


<0| Li p(x), iv(y)] |o> a \. dM? | soos (ee (xy) __ elk . eo) 


kok 
x 1(M’)(u? — M*? (ay - i) (5-81) 
This formula exhibits (1) the disappearance of one-particle contributions: 
<0| jl photon> = 0 and (2) the current conservation. In a space with definite 
metric we would easily derive from (5-81) that the density 2(M7) is positive. 
However, we have here 


Y. 6(p2 — M?)(— 1)"=<O] j,(x) |a><al j,(y) | 


d*k ER ie (ac k k 
= -| sia k + (x—y) n(M7)(y2 = M?)? (0% = a (5-82) 


where n, denotes the number of scalar photons in the intermediate state |x> and 
<O| j.(0) |a>* = <a j,(0) |0>. In Chap. 4 we have encountered examples of cancella- 
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tion between longitudinal and scalar contributions. Here we expect a different 
mechanism since the two kinds have different masses. We first note that states 
containing one scalar particle do not show up in (5-81). Furthermore, Eqs. (5-64) 
and (5-76) indicate that 

| <O|Lio(x), 0 A(y)] |0> = 0 | (5-83) 
This relation can readily be extended to other matrix elements, so that more 
generally we have the operator relation 


Li(x), 0° A(y)] = 0 (5-84) 
which expresses the gauge invariance of the current and reminds us that 0: A is a 
free field: 

@- A= (2%,)"!? 0- A = (2H 5)'/70- A™ (5-85) 

Combining (5-84) and (5-85) it follows that the matrix elements <O| jp(x) «> 

such that |x> includes scalar photons have to vanish. This shows that all terms 
on the right-hand side contribute with a positive sign and 

n(M) > 0 (5-86) 


Thus the interpretation of the sum rule (5-77) is the same as in the scalar case. 
A closer study will reveal that Z3 is in fact independent of the gauge parameter 
A, as the preceding remarks suggest. 

Deriving reduction formulas for states involving transverse photons is then 
straightforward. Let us consider, for instance, a process involving the emission 
of a photon with momentum k and polarization ¢ (ek = 0): 


<B, k, €, out| a, in> = ¢B, out | af}.(k) | «, in> 
=—-i | d>x e** 3, <B, out|e> A? ""(x)|o, in> 
ty 


An asymptotic relation analogous to (5-73) with t=t,— oo and A replaced 
by A’ holds. The above matrix element can therefore be written 


es 1 : 
lim | d*x e*'*Go ¢B, out|e- A(x) + — 8° 0 d- A(x)|a, in) 
ty 


t,o 
1 
disc. =iZ3°@ [as eM (1, + w7)<B, out|e- A(x) saa 6°08: A(x)] a, in> 
Now 


9) eae 
(Ox + p7) = 0,0° Aw) = e = 8, 0: A(x) = (A - 1)0,0° A(x) 


The combination occurring in the matrix element is therefore ¢- j(x) and the final 
expression, with a possible disconnected term, reads 


<B, k, &, out 


a, ins dive. Sa7Zge!? [ats e**¢B, outle-j(x)|a, in> (5-87) 
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This procedure can be carried out again and again. We quote the result pertaining 
to the case where an initial photon (k;, €;) is scattered to a final state (ky, €r). We 
get 


CB, ky, €f, out |o, ki, 6, in> = dise. — Z5' [atxaty elke eke WT], + ps?) 


x <B, out| Ter > j(x) ei A(y) + 4 &° 0 0: a0) | in> 


After the action of the Klein-Gordon operator we would like to replace the 
matrix element by <f, out| T[e,-j(x)ei*j(y)]|a, in>. The two quantities can at 
most differ by a distribution concentrated on the manifold x = y, due to the 
ambiguity arising in the definition of time ordering. We adapt our definition of 
the T symbol in such a way that the two expressions are identical: 


<B, Ky, Ef, out | a, ki, Ej, in> 
= disc. — Z3! [as d*y gilkerx-K¢R out! Ter -f(x)e“j(x) le, in> (5988) 
This formula will be used for an elementary discussion of the Compton effect. 


We return to the proper definition-of covariant time ordering since we skipped any detail between 
the expressions (5-76) for the commutator and (5-80) for the covariant T product. Things are not so 
simple when dealing with tensor operators, and so deserve some comments. For the purpose of 
illustration let us study the product of two conserved currents. Denote by p(M7) the spectral function 
(u? — M?)?n(M7) so that 


Worx — y) = <O| j.(x)jr(y) |O> 


=| ape p(M?) Be O(k°)5(k? — M?) e~ BMF slo (5-89) 
3 (2n)° ie 

Current conservation is explicit and the minus sign in front of the right-hand side follows from 

our Lorentz metric in agreement with a positive p(M7). Let 


4. 


d*k 
Wax mm y) = low 


5(k? — M?)6(k°) e~*- 8» = CO @(x) gy) }O> (5-90) 


be the Wightman function of two scalar free fields of mass M. For the commutator we obviously 
have 


<0| Linx), jy)] |0> = W(x = y) a, W, ly aa x) (5-91) 
We shall first attempt to define a naive time ordering (denoted T) as 
Ol Tiplx)jvy) OD = A(x? — y°)<O] jp(x)jv(y) OD + O(y° — x°)<O} jy) j,(x) OD (5-92) 


Unfortunately, using (5-89) we shall soon discover that T does not lead to a covariant quantity. 
For a free field of mass M, 


i€O| Te(x) p(y) 0> = i8(x° — y°)Walx — y) + i6(y° — x°) Waly — x) 


(5-93) 
--[aene 


Gry MoM sig OV *) 
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Consequently 
i<O| Tj,(x)j(y)|0> = — Ir dM? PM) -y “(an = ee) Bu =e won| (5-94) 
We define a covariant T product as 

i€O| Tj(x)jo(y)|0> = IT!(x — y) + iATpo(%, y) (5-95) 


with 


iT (x — y) 


= {. dM? pio) ( gp 
M5 


= -| dM? PM?) (9 - 
M5 


axa? 
rE Jae — y°)Walx — y) + aoy| 


xaAy 


asa 
ave Gr(x — y) 


oy ae Jov — kyky/M? 

=| dM? p(M?) | —7e ®-» 5-96 

Me mt {an k? — M? +i 7) 

Here iAT,,(x, y) 1s a local (i.e., concentrated on x = y) covariant term which we leave undetermined 

for the time being. We shall see that it can be adjusted to take care of current conservation. Let us 

study the difference between the two time-ordered quantities. To do this we rewrite the term under 
square brackets in Eq. (5-96), using the fact that W,,(x — y) depends only on (x — y)’. 


O70 
[]= (ae = re Jaca — y°)Wu(x — y) + (xy) 


070% 
= [600° “= (0 sire ee hat ot ix ~»| 


+ A 9 905(x° — y°)8%[Wa(x — y) — Wuly — x)] 


From canonical quantization it is clear that 
ig p05(x° — y°)83[ Wax — y) — Waely — x)] = 9p09v05"(y — x) 
Therefore 


p(M”) 
M? 


IT\(x — y) — iXO| Tj,(x) jy) |0> = —gp09v0d*(x — y) \. dM? (5-97) 


M5 


The difference is a contact term concentrated on x = y but not covariant. 
Let us use this expression to study an interesting property of equal-time commutators, or rather 
of their vacuum expectation value in the present case. It follows from (5-96) that 
2 Z ik-(x-y) fe, 1 — ke/M? 
=] ” aM M - Ear ner 
Mi p( ) = ane k2 —_ M2 ae ig 


—6,64(x = y) ie dM? p(M?) 
Mj 


GTS (x — y) 


Il 


M2 
According to Eq. (5-97) this must be equal to 


aoe i) 


—gvo00d"(x = y) l dM 


The current is conserved so that the divergence of the naive time-ordered symbol must equal the 
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equal-time commutator ; hence 


: M?) oe 
—(8y = gvod0)5*(x — y) | aes ee = 15(x° — y°)<O| Lio(x), j(y)] 0 


My 


which means 
<0| Lio(x), jo(v)]0>|.0=,° = 0 Me? (5-98) 
COl Liab) jy ODfarey = 146%(x — y) (am OLD 


Mj 
The fact that a gradient of a delta function appears in the equal-time commutator of a time and 
space component of the current, as a consequence of covariance and positivity, was first noticed 
by Schwinger © hence the denomination of the c-number Schwinger term to recall that it even occurs 
in the vacuum expectaiion value. The vanishing of the equal-time commutator of two time components 
of the currents reflects the fact that integrated over space they generate the conserved electric charge. 
If the theory allows for a larger invariance group than U(1), the currents would carry internal indices 
corresponding to the various generators of the symmetry, and the discussion of equal-time com- 
mutators would involve the Lie algebra of the group (see Chap. 11). 

We can now return to the term iAT,,(x, y) occurring in Eq. (5-95) and chosen to satisfy 


i02<0| Tj,(x)j(y) |0> = 0 (5-99) 
The choice 
(ee) M2 
iAT,.(x, y) = Jpvd(x — y) [ dM? sil (5-100) 
J Mi M 


satisfies all the requirements, so that the complete expression of the covariant T product is 


; =| a*k : kg ov — Kpky 


—ik + (x—y) 
M? J (2n)* k? — M? —ie 


i<O| Tj,(x)j(y) |0> = | aM (5-101) 
Mo 


5-2 APPLICATIONS 


Before presenting in the following chapter the covariant perturbation theory, let us 
illustrate the methods developed so far on some simple examples involving 
photons and charged fermions. 


5-2-1 Compton Effect 


We computed in Chap. 1 the classical elastic Thomson scattering of light off a 
charged center. In 1923 A. H. Compton, in his study of X-rays, discovered the 
scattering process with the frequency shift which carries his name. 

To be specific the target is taken as a free electron. We shall base our reasoning 
on formula (5-88) to compute the amplitude of the effect to lowest order in e, the 
electron charge. To this order Z3 is equal to one and the current can be identified 
with the free electron current 


Jolx) = & : Winlx)y pWin(x): (5-102) 


Henceforth we drop the index “in” and identify the in and out electronic states 
which means that the amplitude is evaluated to order e?. The process itself is 
depicted on Fig. 5-2 where wiggly lines represent photons and solid ones electrons. 
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Bpuey 


Pps OW P;,% Figure 5-2 General kinematics of the Compton effect. 


The connected S-matrix element (c stands for connected) is 
ye a? | fers d*y elke) Ti (x)bW(x): -WO)EiW(y): [pe (5-103) 


The time-ordered operator is expressed by Wick’s theorem in terms of normal 
products involving as coefficients the contractions 


a ale z d*q ete} 1 
G ah) = = er a =1 STN asd 
Wx\Wely) = —Wely Wax) = ISie(x — y) =i | ant ° ( ee ee 


ae" 
Welx)We(y) = Vx) We(y) = 0 
Let us apply Eq. (4-75); we omit contractions between operators referring to the 
same point because of the normal ordering of the current. Thus 


T:Y(x)y W(x): WOW): 
= 1: W(x)ppS¥(x — yy Wy): + i: WO) yS*(y — xX)ypW(X): + °° 


The remaining terms do not contribute to the connected matrix element. We 
expand the free fields w and w in terms of creation and annihilation operators 
[Eq. (3-157)]. If «; and «, denote the initial and final electron polarizations 


|pi> = b*(p;, «4) |0> ps> = b'(Ps, x4) |0D 
we have typically to evaluate 
- d°q; d°q. m’ 
. - bt(n-. a: = ——, —= 
<0| b(py, ay) : Wx) rs. (Di, 0;) \0> (a (Q2n)3 qoqs 


“ » Ue(41, %1)Un( Ga, a2) <O| b(py,&7)"(q1,041)b(q2,%2)b "(qi ai) |O> 


@1,a> 


elldi> x— 42°») 


= e(Ps°*— Pi DZD, of) Uy(Dis Oi) 


This allows us to compute S;__; as 


SiS ier |) dey et 2h») aq 
ae | a 


x ett me u(ps, 4. ae i ee 04) 


£ulDi, a)| 


ae eilPs— 9° y—i(p,—4)- x u(pr, ot p Bi 


1 
q@—m- ie 
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We perform the integrals over x, y, and q, so that 
Shi = —ie?(2n)*d*(ky + pp — ki — Did) 


2 1 . 1 } 
x Ea oper Pe Se fiu(Di, %) + U(Dy, Xy)éi 4,20 £pulpi a) 
(5-104) 


Integrals over configuration space variables have generated the energy momentum 
conservation 6 function. The ig prescription in fermion propagators can be dropped 
since by adding or subtracting a light-like vector (k) to an on-shell momentum 
we necessarily leave the mass shell. 

From the discussion given in Sec. 5-1, the coefficient in brackets is the reduced 
transition matrix element 


1 1 
a oe fet kya ty) a4) (5-105) 


The two terms occurring in 7 can be represented by Feynman diagrams (Fig. 5-3). 
External lines carry the polarization functions ¢, u, or u and the corresponding 
momenta. Internal lines correspond to the propagators. Vertices which represent 
the interaction ey are such that energy momentum is conserved at each of them. 
This explains why the first propagator corresponds to k; + pj = ky + py and the 
second to pj —kys =p, —k;. Total energy momentum is therefore conserved 
throughout the diagram. The limit » +0 for photons is trivial and it is easily 
checked that if one of the polarizations is longitudinal (index 3) the contribution 
vanishes. We can therefore limit ourselves to the two transverse polarizations. 

Using (5-105) let us compute the scattering of photons on unpolarized 
electrons when the final electron polarization is not detected. This amounts to 
averaging the probabilities on initial (electron) polarizations and summing over the 
final ones. Up to kinematical factors we therefore have to evaluate} )"2..«,|-7 pi|?. 
The scattering cross section is given by Eq. (5-13) with the appropriate modifica- 
tions due to the presence of an initial electron [flux factor (2/m)p;+k;] and of 
an electron in the final state [phase space factor m d*p,/(2z)*p?], leading to 
the expression 


TF 44 = —e? il(py, H,) (4 


1 m dks md°p 
do == ¥ (2n)t5 (py + ky — pi — ki) | F 5? —— ~ es = 
D » oe \7 ji 2p;:+k; 2k9(2n)> (2n)3p% 


Ky, €y k 


1? 


foe é; 


Pri Bt k Pi Dp Xp = Pike — Dj, 


Figure 5-3 Lowest order Feynman diagrams for the Compton effect. 
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Let us use laboratory variables with 
dk? dks 
d(p? + k) k?) 
where Q is the emitted photon solid angle measured with respect to the incident 
momentum. We have 
dpy + dk¥ 1 d(p$)* 1 27 __ Prk 
ie 7 ee. a” es iat y= poke 


| dps d>ky (pp + ky — pi — ki) = (k9? dQ 


Hence 


1 (kom)? 
do = a? — p> 1g ee ee IG) 
2! | Dit kips sky 


where the kinematical factor is derived from momentum conservation 
lab 
(pit ki)’ =(pp tk)? pitki = perky = mk? 


For photons we use the notation k = k° = |k|, and we are left with 


(e i dQ = (5-106) 


Rationalizing the denominators and using Dirac’s equation the matrix element 
can be rewritten 


2 


(0.4 


A 


U(D ys, Xp )éy £iu(Di, &i) 


————— 
Bit ki—m 


+ u(Dys, a,)éi 


1 
a ae fsu(Di, %) 


1 1 
Uu(ps, oy) (« ey, fit bi i as tr) dui) 


= u(p,, a(t aoe us Pe ot eu, ati) 
2(pi + kji)* 8: — gk; 2(p; — kp) er + 
= ipa) | fy APE Ag, Moe Baler SE Nae, 0 


The transverse photon polarizations ¢;, ¢, can be chosen orthogonal to the “time” 
axis p; and to k; (for ¢;) or ky (for e,). Hence the previous matrix element takes 
the form 
2 Htiki tits ky 
—t(ps, o,) (je ate Dok u(pi, Oi) 

We should have noted that the total contribution of the two Feynman diagrams 
is invariant under the exchange 

(Ei, ki) > (es, —ky) (5-107) 


permuting initial and final photon variables with a sign reversal of the momentum. 
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This is an example of a new symmetry of transition amplitudes, called crossing 
symmetry, which will be elaborated later (Sec. 5-3-2). Using 


), udp, aide (p, o) = (G*) (5-108) 


a 


we can now sum over electron polarizations 


7 pitm~pr+m 
= i(p;. o;)|7 = O 5-109 
X » | u(py, x)OU(pi, x) | tr (0 5 5 ( ) 


with O = y°Oty®, and the trace is to be taken over Dirac indices. In the present 
case we find the cumbersome expression 


= drt tity \ pit m (Kidity — Ketrti\ Be +m 
X16 (as ae fists.) 2m (pe a pe | om | (5-110) 


The trace can be divided into four terms: two diagonal ones, each of which is 
obtained from the other by the substitution (5-107), and two crossed terms. The 
latter are equal since the trace of an odd number of y matrices vanishes and 


tr (y1¥2°°* Yan) = tr (yan'** 271) 


as follows by transposition and relating y to y’ through the charge conjugation 
matrix C. Two nontrivial traces remain to be computed. We try to make the best 
possible use of the identities k? = kj = 0, e7 = e} = —1 by applying repeatedly 
the identity 44 = — bd + 2a-b to move the corresponding momenta adjacent to 
each other 


Ty = tr LépAiki(bi + m)Kidié (By + m)] = te [£ AK Di Kid By] 
= Qpi ky te (¥y Gilt: fit By) = Qi ki tr (Ep Kid By) 
= 8p," kee ky es+ pp + kis py) = 8p; "ki [2(er- ky)? + ky * Di] 
where at the last stage we have used momentum conservation which implies 


kis pp =Ky+p; and e¢* py = ep (pi tk; — ky) = éy*k;. (We have chosen é,* p; = 
&* p; = 0). Similarly, 


Ty = tr [£sfiki(Bi + mK sé é(bye + m)] 
= tr [frAiki(Bi + mK s£sé(Bi + ki — Ky + m)] 
= te [Ki(Bi + mK pb phit é(bi + m))+ 2k, tr (£:Ki Pik p) 
—2k;- er tr (Kipik sey) 
= 2k; p; te (Kp tr tit tipi) + 8(ky > &;)*k; * pi — 8(k;- Er)” Di "ky 
= 8k;° piky- pp [2(es+ 8)? — 1] + 8(kp-&)*k;- p; — 8(ki Es) ky Di 


The second crossed term T3 is equal to T,; finally, T, is obtained from T; by the 
substitution (5-107). Adding these quantities with the appropriate factors occurring 
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in Eq. (5-110) and using laboratory kinematics yields 


1  epinek: 3 
x= am af pee &) -2| (5-111) 
Inserting this in Eq. (5-106) we get finally the Klein-Nishina (1929) scattering 
cross section for Compton scattering off free electrons: 


do _ a? ky ky k; 
xa =e 2 (Z) K +; + 4(e,+ 6)? — -2| (5-112) 
The frequency shift is readily related to the scattering angle 6 by squaring the 
: four-vector equality k ; — kj = p; — pr. We get —2k?k?(1 — cos 6) = 2m(k? — kP). 
_ Thus 
k; 
1 + (k;/m)(1 — cos 6) 


In the low-energy limit k;/m—0, formula (5-112) reduces to the Thomson 
expression (see Sec. 1-3-2) 


do 

dQ 

Near the forward direction, i.e, when 6 +0, the Klein-Nishina expression also 

reduces to the nonrelativistic result (5-113) whatever the incident energy may be. 

If the incident photon beam is unpolarized and the final photon polarization 

is undetected it is necessary as in the case of electrons to average over ¢; and sum 
over é, to obtain the unpolarized cross section 


ky = 


a? ow = 
= Sere)? Sry = 28 x 107 om (5-113) 


do 1 do 
02> ao 
As in Chap. 1, DN », (ee rr) = 1+ cos? 6, leading to 
do a? ky x ky k; - 2 
—_ = —— 6 5-114 
dQ ~ 2m? ) é ae Ce 


| A final integration over angles (x = cos @) gives the total cross section 


mm noon F 1 i 1 1—x ! 

=e |, “Vfitkimd—opP  1+k/md—x [1 +ki/m(—x]? 
If oo denotes the total Thomson cross section and w the ratio of the initial energy 
i to the electron rest energy 


2 ’ 
(eee (5-115) 
3m m 


2 1+ 2@ 


B 3 (1+ @[2a(1 + a) In(t+20) 1430] (146 
ne a | ~in(1 +20) | B20) i + 20) ( ) 
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Figure 5-4 Ratio of the unpolarized 

0 Compton cross section to the Thomson 

0.001 0.1 10 1000 cross section as a function of the inci- 
w=k,/m dent energy w = k;/m. 


The low-energy (w <« 1) and high-energy (w > 1) limits follow from Eq. (5-116): 
P 1 — 2m + O(w”) wo<«l 


oo jy [in20+44 o(*2)] w >1 
8a w 


The ratio G/do is plotted on Fig. 5-4. 


(5-117) 


We have verified the crossing symmetry of the Compton amplitude to lowest order in e. The reader 
can extend this property to all orders by examining the expression (5-88). As shown by the reduction 
formulas of Secs. 5-1-3 and 5-1-4 this can be generalized to more complex processes involving two 
charged conjugate particles occurring one in the initial state and the other in the final state. The 
amplitude is invariant by interchanging the incoming particle and outgoing antiparticle and reversing 
their four-momenta. The corresponding amplitude is, of course, unphysical since it involves negative 
energies, but the analytic continuation of amplitudes can be obtained through the reduction formulas 
themselves (see below in Sec. 5-3-2). 


5-2-2 Pair Annihilation 


The preceding calculations may be adapted to obtain the rate of a related process, 
namely, charged-pair annihilation into two photons, the opposite process from the 
one studied in Chap. 4. The starting point is an expression similar to (5-88) but 
corresponding to the reduction of two photons in the final state. However, Bose 
statistics implies that if Q denotes the solid angle of one of the emitted photons, 
the total cross section will be obtained by integrating 4do/dQ over 4x, that is, 
do/dQ over 27. Of course, our expressions have to be symmetric in the interchange 
of the two photons. 

The kinematics is recalled in Fig. 5-5a where an electron (p;, «,) and a positron 
(p2, %2) annihilate to emit photons (kj, £1), (k2, €2). The corresponding amplitude is 


Spi= —Z3' | fats d*y eM **" Ol TLes + j(x)er*j(y)} [Pry 1, Das 2, in) 


Just as before, this is computed to lowest order taking into account the same 
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ky,€ Pye “i | ky, € ky, €) ky, € 


ky, €4 P2,% (b) 


Figure 5-5 Pair annihilation: (a) general kinematics and (b) lowest order Feynman diagrams. 


type of contributions. Straightforward calculations lead to 


Syj = —ie?(2n)*6*(ky + kz — pi — p2) 


1 
— oon + Ai hur, 1) (5-118) 


a 

2 Fi) i ey ee... 
Pi-ki be — kom 

showing a remarkable analogy with the corresponding Compton amplitude. This 

is exhibited in Fig. 5-5b, where the fermionic lines are oriented according to the 

flux of charge e. This amounts to a reverse orientation as far as the positron 


x B(p2, X2) \ 


- momentum flow is concerned. A simple substitution allows us to derive Eq. (5-118) 


ema AER CC LL LL: I I eR 


from Eq. (5-104) for the Compton effect. It reads 
(ki, €:) > (—k, €1) eae Obi) > as 1) pi Pr (5-119) 
(ky, &7) > (ka, €2) U(Ds, %s)—> 0(p2,%2) Pr —P2 
Crossing symmetry is replaced by the Bose symmetry of the annihilation amplitude 
and the substitution (5-119) yields another instance of the crossing relations 
between processes. 
From the matrix element (5-118) we derive the cross section for unpolarized 
electrons and positrons. The flux factor is (1/m?)[(p1-p2)* —m*]'* and the 
statistical average yields a factor of 4. Use is made of a relation analogous to 


Eq. (5-109) involving 7 
-¥, 0, aatp, «) =F ™ 


The cross section takes the form 


eG — =m'e* ie oki fs a giko¢2 \pi tm 
°F pay — my?) 4 WN 2pieki | 2pick) 2m 
3 3 
fikids #2kof1 \—Pot+m , — a’k, dk 
ie 7 See on (2my"d%ki + ka — Pi — Pa) 45093 DKR(2m)3 


The photon polarizations ¢, and ¢, have been taken orthogonal to p; (and ky or 
kz respectively). The trace is given by the substitution rule (5-119) applied to 


(5-111) 
tr) = 3 -(2 +#) + 4(€,* 82)" — 2| 
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We perform the computations in the rest frame of the electron. Thus 


enol.) ko’ dk9 
a AG CNT] an? aeaKe aT HD) 
with k? given by energy momentum conservation. Hence 
d(k? + k?) Lod . kirk 
sah a Lipa pees 77 oe a) Ma eek 
dee tee ag 2 — B= pay 
ky+k2, = m(m 4° E,)= k8(m + Es — | p2 | cos 9) 
Finally, 
do a? m+ Ez ky ky 
a — + — — 4(e,;+&2) +2 5-120 
Q TREE ue (e e2) ( ) 


yields the differential cross section for the emission of one of the photons in the 
direction defined by the solid angle Q, corresponding to a positron (E2, p2) 
impinging on an electron at rest. In (5-120), k,; and kz stand for the photon 
energies with 


kz _ E2 —|pa| cos 0 


5-121 
re - (5-121) 
To obtain the total cross section we sum over photon polarizations: 
2 
m 
(6, °&2)? = 1 +[1- (2+ *)| 
and integrate over half the total 47 solid angle with the result 
2 o. 
mro 6 |y7 +4y4+1 = y+3 
= | 4 jy? = SS 2 
with 
E 
ro = * y= a il 
m m 


In the two limiting cases y — 1 (threshold) and y > oo (high energy) we find 
Tro nr 
(=e mons y—> 1, v2 = positron velocity 
o=4%,, ? - (5-123) 
|x —[InQy)-1] ya 


The Born approximation used so far is a priori rather poor at low energies where electron and 
positron interact through long-range Coulomb forces. If, however, (5-123) is used as a first guess we 
find that the annihilation probability per unit time of a slowly moving positron given by 


w =o x incident flux x target density ~ ov»nZ = nZar3 (5-124) 
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is independent of its velocity. Here n denotes the number of atoms per unit volume and Z their 
atomic number. A low-energy positron in a medium has therefore a lifetime approximately given by 


t=w / (nZnrz)' (5-125) 


In lead, for instance, t ~ 10~'° s. The general formula (5-122) was first obtained by Dirac in 1930. 


5-2-3 Positronium Lifetime 


We have just studied the free annihilation of an electron and a positron neglecting 
the Coulomb force. The latter can manifest itself through the binding of the two 
particles in the neutral positronium system, a lepton analog of the atomic bound 
states already discussed in Chap. 2. We may extend our previous results to obtain 
the decay rates of the para- and orthopositronium s-wave ground states. Since 
annihilation is a slow process the width of these unstable states is very small 
compared to the nonrelativistic binding energy. This justifies a two-step treatment, 
neglecting at first annihilation to compute the binding and then neglecting binding 
to compute annihilation. In a rigorous approach positronium would not appear 
as a stable asymptotic state and would only occur among unstable excitations. 
To zeroth order, the electron and positron are described by a nonrelativistic 
isotropic wave function 


1 
Yi) = Tain eae 


d°q . d°q_ ... 8(na*)'/? 
= iq’t = 1g Nee 
(es en wihia) ie 0 ay 
where a is twice the Bohr radius of atomic hydrogen (since the reduced mass is 
half an electron mass) 


(5-126) 


The meaningful electron or positron momenta are of order 1/a = am/2 «m and 
annihilation can be computed in a first-order approximation as if both particles 
were relatively at rest. The second step is therefore to compute the width of the 
singlet and triplet states for free particles at rest. | 

In Chap. 3 it was established that charge conjugation implied that the singlet 
state decays into an even number of photons, the lowest possible number therefore 
being two. We use our preceding result multiplied by 4, since instead of averaging 
over four polarization states we have here the decay of a given spin state. — 

Furthermore, we have to reconsider the reasoning of Sec. 5-1-1, giving the 
relation between cross section and transition probability per unit time, denoted 
here by tj,,- To obtain the latter we multiply 40 by a normalization factor 
constructed as follows. We substitute for the flux factor 


ie m -1 
m2 (P1 * 2)" _ m*] 1/2 ~ v3 
[ | p2| |p2|0 
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a weight 


1 
= | (0)? =a 


ad .~ sale 
la (q) 
This arises by repeating the calculation leading from Eq. (5-4) to Eq. (5-13) in the 
case at hand. Thus 

" Lee 1 ma 
Ting = 93 lim (4020) = 25 (4nd) = 
v2.70 


We note indeed that the ratio eal 2 where B is the binding energy B = ma?/4, 
is of order «? ~ 107° justifying the above treatment. 

The previous calculation may be interpreted as giving the decay probability 
per unit time in terms of the transition matrix element at threshold as 


t t=) (2n)*6(P? — 2m)d3(Py) |< f|T | ete >|? | WO)? (5-129) 
if 


(5-128) 


This formula enables us to discuss the orthopositronium case where the minimum 
number of emitted photons is three. From the reduction formula to lowest order 


3 
Spi = (—ie)? [as d*x, d*x3 exp (: oy kx) 
i 


s 


x €O| TE: W(x )£rW(xa): 2 W(x2)boW (x2): W(x3)ésW(x3): | |e*e™, triplet> 
(5-130) 
Wick’s theorem yields for the matrix element 


CO: W(x1)61iS(x1 — X2)f2iS(x2 — x3)é3(x3): |e*e”, triplet) 


permutations 
which inserted into (5-130) and after integration gives 
Spi = i(27)*6(PF — 2m)°(Py)TF 5: 


1 
= —ije>(2n)*5(P¢ — 2m)d*(P 54, ———_—____ 
(2n)"9( f J0°( ) ee Ft P- ko — mn (5-131) 


x £2 


1 (B) 

p—k3;—m a 

Here u” and 0 represent the Dirac spinors of the electron and positron at 
rest, and a projection on the triplet state has to be performed. The common 
energy momentum of the fermions is p = (m, 0) while (k,, ¢1), (k2, €2), and (k3, 3) 
are the momenta and polarizations of the final photons with P,; = > ik. Of 
course, 7 is symmetric in the photon variables as required by Bose statistics. 
The integral over phase space will have to be divided by 3!. The result expressed 
in Eq. (5-131) may be represented by the sum of six Feynman diagrams, such 
as the one shown in Fig. 5-6a, obtained by permuting the photon variables. To 
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ky,€, ky, €, k3,€3 


=, P,B 


(2) (b) 


Figure 5-6 Three-photon decay of orthopositronium. (a) One of the six diagrams describing the 
process to lowest order. (b) Choice of axis to measure the photon polarizations; n is the normal to 
the reaction plane. 


proceed with the calculation let us introduce the notation 
FT pe = —e° yfZai237- xt = xtio2 (5-132) 


with y, standing for the two-component spinors describing the fermions. We 
shall illustrate a method different from the one which consists of the computation 
of traces when squaring 7,_;, namely, we shall explicitly obtain the 2 x 2 matrix 
€,23. With obvious notations the latter is derived from a 4 x 4 matrix as follows: 


1 1 1 
4123 = (0, Ai a= m f2 se Al§) 


Set w; = k?, k; = k,/w;. Choose, furthermore, &;*p = 0 for j = 1, 2, 3 and let us 
write 0; for k; x ¢;. This is a unit vector since k;- ¢; = 0. Then 


1 Ki —pitm bik 
oo ——__—, = —(0, 1} ——_ 
(OA peg = A pet = Oe, 
“ 1 fay nee) 
7 ~ 2mo, fa 6 0( 501 — 160) 
= —7— (0°28, ia 4) 
Similarly, 
1 F ie 2(' . 
p—ks—m No} = am \ ares 
Therefore, 


“oc ae eae 
A123 = 7 ale £1, 16 * 63) o°e 0 O° 83 


= ——~ (6°8 0° & 6°83 + 6° 5; 6°82 5° 453) 
4m? 
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The required combination ae 4123 is of the form o°V. Indeed its trace 


vanishes since it is proportional to 


» [(e1, £2, £3) + (6;, &2, 63)| =0 
perm 


Recall that tr ojo jo, = 2ié;,. Thus _— 4,23 = a° V, where 


1 
2m? 


d [erler + #3 — 62° 53) + 5y(e3 ° 52 + €2° 53)] (Sala) 
cycl 

Clearly the lifetime of orthopositronium is independent of its polarization state. 
However, the angular distribution when we observe the photon polarizations 
depends on the initial polarization state. We shall limit ourselves here to the 
total rate which can be computed for an unpolarized orthopositronium by 
averaging over the three spin states. Therefore, 


3 aI 3 
ss ae 1 ha O*S (amy*6(E co; — 2m)SAEK,) 


“inlet 6 | 8a,@2@3(2)° 
6 
x=, FY WYP +|KWP24+/h 7%) (5-134) 
ma photon 


polarizations 


In writing (5-134) we have taken into account the factor io, explicitly written 
in Eq. (5-132). For instance, the decay amplitude of the state |J = 1, J, = 1>= 
5,4) is proportional to the matrix element y1,,i¢20°Vy1 2 = V; + iV, = V., 
while the corresponding amplitude for the state |J = 1, J; = 0> = 1/,/2]4, -3> + 
1/,/2|—4,4 involves 
1 . ; 
V2 (x7 121020 ° Vx1/2 -- 11/2i620 ° Vx—-1)2) = —./2V; => Vo 
The last amplitude for |J = 1, J, = —1> introduces V_ = — V, + iV. Therefore, 
4(| Ve | ata | Vo |? oe | Ve |?) = 4(V2 + V7? + V2) =4V? 


To complete the calculation it is convenient to use complex polarizations such 
that e* = ¢ + ié satisfying 


et? -0 
ese) = 2 


BT 8s = 8° &2 — 51°52 + i(e, ° 62 + 82° 5) 
Thus 


1 
WS = — alps pa Slr Sraar 
a 2, 8 (@2 £3) + 8; (82 * 83) 
4 i 
~ 12m4 


+ [let - ex)(ez + es )(es « e¢) + cc} 


zy? Re 2 {2|ez + 83 |? + 2(et - ef) (ez » 3 )(€3 + 87) 
cyc 
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We now sum over photon polarizations, choosing for each photon two orthogonal 
basis states (a) and (b) with ¢ along the normal to the reaction plane or along 
k x n(see Fig. 5-65): 


(GQ) a b=kxn e+ sa=n+ikxn & =a*=n—ikxn 
(b) e=kxn 6=-—n e* = —ia e- = ia* 
Thus 4 

Z Wat vlecal 


photon polarizations 


Only the first term in the expression for V has contributed to the sum over 
photon polarizations. Moreover, 


a> +a; =(n + ik, x n)-(n+ik; x n) = 1—cos 623 


with cos 023 = k,-k;. The final expression 
: ‘ , 4 
3(V. |? + | Vol? +] V_ |) =$V2 = aoa ¥ (1 — cos 633)" (5-135) 
cycl 


may now be inserted into (5-134) to yield 


ae a'm?*(4n)°a? 41 1 eee 


Triplet = 8x m* 36 (2x)? 


53(¥ k,)6(Z w; — 2m) 
. x > (1 — cos 023) 


cycl 


8010203 


Each term in the cyclic sum contributes equally. Therefore 


6 
Triplet ae i daw, da, (1 — cos 012) 
(5-136) 
1 — cos 6;, = 2m Sea 
172 


and the range of integration is the Mandelstam triangle D:@, + @2 2m, 
0 < @1, @2 < m. Let us write 


6 1 1 eee 
ee 2) Ge a Yd (5-137) 
triplet 31 Fie: 


0 lg 
The integral A gives 


aa 
A= |S p- 24x20 -9Ind-s)= 34203 
o x i 


The last sum )P 1/n? equals 17/6 so that A = (x7 — 9)/3 and we get the final 
value quotéd in Chap. 3: 


= 2a° 
Triplet = MO (x? — 9) (5-138) 


From the above expressions one can easily derive the energy spectrum of the emitted photons. 
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5-2-4 Bremsstrahlung 


In the first chapter we have given the classical radiation rate for the emission 
of photons when a charged particle suffers an abrupt change in velocity. We 
want to give here a more precise treatment, taking as an example Coulomb 
scattering off a fixed nucleus (charge — Ze) discussed in Chap. 2. The matrix 
element of interest is given by Eq. (5-87) if we deal with one photon emission. 
Assuming 


a (5-139) 
D 


where v is the incident electron velocity, we shall perform an expansion in Ze 
as well as e and retain only the leading contribution. Proceeding heuristically, 
the external field is accounted for by replacing free spinors by solutions of the 
Dirac equation in the presence of the external field A°*' such that 


Let therefore y**' denote the solutions of 
(ig — eA™ — mx) = 0 


subject to appropriate boundary conditions. We write the radiation amplitude 
Syi = —ie [atx ef * Wein(x)AWin'(x) (5-140) 


with k? = 0, e = —1, e-k = 0. The incident electron is specified asymptotically 
by its momentum p; and polarization a, and the outgoing one by p; and f. To 
lowest order in A®' we have seen in Chap. 2 that 


Win (x) = Win(x) + [ey SF(x — y) eA™(y)Winly) + 
(5-141) 


Sate) = Wenden) + ey Wourly) eA™(y)SF(y — x) +> 


Here Win, Wour Stand for the corresponding solutions of the free Dirac equation 
Win(x) = 7"? * u(p;, &) 
Wour(x) = elPs , “apr, B) 


where, of course, p? = p7 = m*. When substituting (5-141) in (5-140) we observe 
that the contribution proportional to J d*x e'***\,,,(x)¢Wi,(x) vanishes for k? = 0. 
Therefore to lowest order 


(5-142) 


Sp; = —ie* [ax d*y [e™ * ou(x)AS*(x — y)A™(y)Winly) 
+e Woulx)A™'(x)S*(x — y\eWiny)] (5-143) 
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We note an analogy with the Compton amplitude where the external potential 
A(x) replaces the free photon wave function e~ ™** e(k). Using (5-142) together 
with the Fourier transform of the potential 


Ze d*q 1 
4oxt ) 0) ,-iq:x a 


we readily find 
iZe? 
Spi = pet Bone 2nd(p? — p¥ — k?) 
ih 
Pp t+k—m : 
Only energy conservation has survived since space-translation invariance is 
broken by the presence of the center of force. 


Once more the derivation of the cross section has to be modified with the 
result that 


“ 1 
x u(p,, B) [ oye aaa lees at) (5-145) 


2-6 
=e | 2n5(p} + k° — p?) 


do = 
pr |i 


q 


(5-146) 


[ ] 2 md?p, d*k 
[pp +k — pi|? | 2p7k(2n)° 


where p?|v;|/m =|p;|/m is the flux factor and the square bracket is the one 
occurring in Eq. (5-145). 

To obtain the unpolarized cross section we sum over ¢ and f and average 
over x. To simplify the notations we write q = (py + k — p;) for the transferred 
momentum, let w = k® and Q,, Q, be the directions of the outgoing photon 
and electron respectively. Thus 


Pape Zz 
ee Pa re dado a, (5-147) 
nm |pi| 14 
rer: Paka it o ,obi-~ Kk +m \(") 
Fa 55 ( epee") Om 


(5-148) 
Bei ae 1 -—k-m pr +m 
x ( ° ie Moe, =f po bem\(ien)| 
a 2pr-k 2p;°k 2m 
The quantity F can be decomposed into three parts: 
1 
= Femon 1 + F, + F3) 


ey te [Ae ey oe EE +, + mi) 
(Pr° k) @ 


P=; —Pp,) 


F,= tr {[y°Bi-— K+ mA: + m)y°(Bp + K+ me + m)] + (Di — Py) 


1 
(py kip: k) 
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Choosing e° = 0 we find 
F, = =D Pe: Pr) )2(m? + 2pi op? + 2poe — pit py — pik) + 2e> py €° pi ke py 
Ps 
+ 2pPw k- ps — pik pj k] 
F, = F,(pj«> — py) 
16 ; 0,0 2 
F3 = —————_ ¥ [e- pe* (Pi  — py ke + 2p py — Api ps — 2m’) 
>= toe Bipr bY bee Ps f 
+ (€*pys)?k+ pi — (€° pi)?k+ pp + pit k ppt k — mw 
+ w(@ pit Ps — PP Ps k — py pi" k)] 


Let 6, be the angle between the vectors k and py, 6; the angle between k and 
p;, and ¢ the angle between the planes (k, p,) and (k, p;), as shown on Fig. 5-7. 
The sum over photon polarizations can be performed using 


> (eps)? =|py|? sin? 6, > (epi)? = |p;|? sin? 6; 
(2° py)(e* Pi) = |r| |p:| sin 4, sin 8; cos @ 
For the final formula we shail use the notations 


pPr= Es pP=E; |ppl=pr tele ial? =? (5-149) 


The differential cross section as found by Bethe and Heitler in 1934 can now be 
written as 


we be ae dw 
— dQ: 
~ Qn) pig* @ 
py sin* Os 2 2 pr sin? 6; 
= GP ns A) 4E; = 2 202 : 
: le — py cos 8 a | a) pce ee (5-150) 
Fae pi sin? 0; + p7 sin? 0, 4 ___Psbi Sin 6; sin 05 cos @ 


(Ey — py cos OE; — pisin 0) ~ (E> — pr cos 0,)(E; — p; cos 0) 


x (4E;E, — q? + 20°] 


” Figure 5-7 Kinematics of the bremsstrahlung process. 
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This result has a rather complicated structure exhibiting the by now familiar 
catastrophic (!) behavior when w — 0. 


In the limit «w — 0, Eqs. (5-147) and (5-148) may be written approximately 


do do a*k g- e-p,\2 
snl) ely- ey ou 
dQ. 440 \dQ. Maite 202m) k* py ke pj 


as claimed in Chap. |, where the results on the integrated cross section were already given. 

In this example one realizes that for k small, practical observation does not allow us to 
distinguish between the elastic cross section including correction to order e? and the inelastic one 
treated to the same order. We can therefore only expect that the sum of these two has a finite 
limit to this same order. This will be discussed further in Chap. 7. 


5-3 UNITARITY AND CAUSALITY 


The reduction formulas have related the measurable on-shell scattering amplitudes 
to the general Green functions. Most of the developments in the remainder of 
this book will be devoted to the computation of these functions in the framework 
of a local relativistic dynamics described in terms of a lagrangian. Nevertheless, 
it is important to stress and abstract the important physical requirements such as 
unitarity and locality common to the various dynamical systems and possibly 
more general than the present-day applications. One of the main outcomes of 
such an analysis is the proof of a specific analytic structure of amplitudes leading 
to dispersion relations in one or several variables, complemented by rules providing 
the boundary values along the various discontinuities. It is, however, not our 
purpose to investigate these developments in detail except to assert their im- 
portance and quote some significant results. In the next chapter we shall see how 
the analytic properties are reflected in the perturbative expressions. 


5-3-1 Unitarity and Partial Wave Decomposition 


Unitarity of the S matrix reflects the fundamental principle of probability con- 
servation. Even though we have to introduce in certain instances the artificial 
device of an indefinite metric in Hilbert space, the physical quantities always 
refer to states with positive norm, preserved through the time evolution. Formal 
developments should not obscure this important fact, reflected in a number of 
relations, the prototype of which is the optical theorem of Bohr, Peierls, and 
Placzek. . 

Let us therefore look at the restrictions implied by the unitarity of the S matrix 
written as 


StS =I1=14+iT-—T')+T'T (5-152) 
in accordance with Eq. (5-6). Inserting momentum conservation 


Cf|T | = (22)*64(Ps — Pd) T 5: (5-153) 


242 QUANTUM FIELD THEORY 


we obtain the nonlinear relation among amplitudes: 


(Fy — Tj) = iY) (2m)*5*(P, — PT is Fni (5-154) 


involving a sum over all possible intermediate states |n> coupled to |i> and | f>. 
In practical cases the scattering process is initiated by a two-body state. Let us 
concentrate on two-body elastic scattering. Generalizing this definition we may 
assume that the particles exchange spin or internal quantum numbers, in agreement 
with the corresponding conservation laws, provided they remain within the same 
symmetry multiplet (see Chap. 11). For simplicity we ignore the case of identical 
particles. First choose |f> = |i> in Eq. (5-154). This corresponds to forward 
scattering with spin and internal variables equal in the initial and final con- 
figuration. We have to assume here the absence of long-range forces. The left-hand 
side of Eq. (5-154) reduces then to 2i Im 7;;. The right-hand side is related to 
the total cross section o,,,(i) up to a flux factor contributed by the initial state. 
To be specific, denote by (m,, S,) and (ms, S,) the masses and spins of the initial 
particles. For a given spin state i and total center of mass energy squared s we 
derive, from Eq. (5-13), 


1 


Stor(i) = ota sane ae) Win oP, — Pyar. (5-135) 
where A(X1, X25 X3) = (x7 ae xe ar x3) oe Dak, a 2X2X3 aa 2x3X1 (5-155a) 


and we have used in (5-155) a normalization of states appropriate to an invariant 
phase space element d°p/2E(2z)>. The common center of mass three-momentum 
|p| of particles A and B is related to s through 4sp” = A(s, m2, mg). Using (5-155) 
we find the optical theorem in the form 


Im F;; = A'!7(s, m2, mé)o,o,(i) (5-156) 


The amplitude .7;; enters in the expression of the forward elastic cross section. 
Assume the polarizations to be such that the initial state is invariant under rotations 
around the incident momentum. The elastic cross section can then be integrated 
over the azimuthal angle and expressed in terms of the momentum transfer t, 
instead of the cosine of the scattering angle. In the process A+ B— A+ B call 
(Pa, Pp) and (pa, pp) respectively the initial and final momenta satisfying energy 


momentum conservation pg + Py = Pa + Pp. The Mandelstam variables are defined 
as 


Ss = (p. + ps)? = (pa + Ps)” 
t =(Pa— Pa) =(Pe— Pi)? Ss S++ u = 2(m?2 + m3) (5-157) 
4 = (Da Pale — (Pt — Pa) 

The differential elastic cross section can be written as 


|F(s,? 
167A(s, m2, mz) 


do. 
dt 


(s, t) = (5-158) 
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Forward scattering is characterized by t = 0 and 7(s,0) is what we denoted as 
J; above. Consequently, 


dG. _ 1 [ ReA/ i. | el) 
at T6n Leary tom 60 iat 


In very high-energy collisions it may happen that the imaginary part of the forward- 
scattering amplitude dominates over the real one. In this case Eq. (5-159) may 
serve to normalize the differential elastic cross section, assuming the total cross 
section to be known. 

It is possible to extract further consequences from the unitarity condition if 
we succeed in diagonalizing the 7 operator, at least partially. For two-body 
scattering this is achieved by using as a basis the eigenstates of the total angular 
momentum. 

The helicity formalism of Jacob and Wick is best suited to perform the 
corresponding projection. Call A, and A, (Aj, A) the helicities of the initial (final) 
particles in the center of mass frame where p,+ p, = pz + p, = 0. For a two- 
body state in this frame let 6 and @ be the polar angles of the relative three- 
momentum p with respect to a fixed system of axis. Consider Ro», the product 
of a rotation of angle @ around the y axis followed by a rotation of angle @ 
around the 2 axis. This rotation transforms the unit vector Zz into the vector 
p/|p|. A state of total angular momentum J with projection M along 2 is obtained 
as 


pre ONE , 
J,M3 Ag, As> = ( - [a0 sin 0 dO D1 _1,m(Rad)|Po 4a; Pb, 4» (5-160) 


Helicity is defined as the component of spin along the three-momentum, thereby generalizing the 
case of Dirac particles, and the notion extends to massless particles. 

We recall that the Wigner Y functions are obtained from the representations of the SU(2) 
group of special unitary 2 x 2 matrices as follows. Any rotation R acting on an arbitrary vector 
p can be represented by a pair U, —U through Us: pU't = o:(Rp). If we consider more generally 


S 


at ae See 
an arbitrary 2 x 2 complex matrix A = ( ) it may be identified as a linear operator on homo- 
: c 


d 
geneous polynomials in two variables u,,2,u-1)2. To obtain %/, »’, we extend this action on polynomials 
. . * Py ; es i+ j— 

of degree 2j, a basis of which is ty, = [(j + m)!(j — m)!]~*? ui"u~ 172. If 
u4)2 = duyj2 + bu_1)2 
ue 1/2 = Cuy)/2 + du_ 1/2 
Um = Dy (AYU jm’ 
then 
q’' b™ c d™ 


aa inn 
nytny=jtmnatns=j—m n,! nz! n3! Ng: 


nyt+n3=jtm' nztng=j—m" 


Dj, mA) = [UG + m1 — my! + mG — mt Paley 


These matrices enjoy the properties 
DHAyAz) = DUA\)D{A2) DAT) = BUA)" 
Di A*) = D4(A)* Di AAD) = A745 ine’ (5-161b) 


Dim (ei???) = ent oe ym’ 
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In particular if A is unitary, so is (A). Returning to the group SU(2) the notation in Eq. (5-160) 
identifies the rotations Ro» with its two-by-two representative in such a way that 
DE _ a, u(Rod) = Bits, -2,(9, 0,0) = eM dhe a, 3,(8) 


(S-161c) 
dn a(8) = (— AY" aE uy, —my(O) = (= 1 dina (B) 

Phase conventions may be chosen in such a way that if 7, and n, stand for 

the intrinsic parities of particles A and B the transformation laws of these states 


under parity P and time reversal F readt 
P\J, M3 Aas Ay> = Nate(— 1)? 5:7 56 |J, M; — Aa, — Av? 
ely. M; Aas Ay? =e (= I al aM ; Aas Ay? 


These relations reflect the fact that helicity is odd under parity and even under 
time reversal. 

Using the Wigner-Eckart theorem we may then write for the matrix element 
Z,; the following expansion taking rotational invariance into account 


Zep <P,, 4a; Dos Ao Par Aas Des An? 


, (5-163) 
= l6x) (23 + NF 4. a(S) PL a,x, as 9, 9) 
J 
A kinematical factor arising from the relation 
dpa d*pp 44 Phd mi, m}) 
ae , iP) eee 
| On OE) On QE) O") ° Wet — P=—go ee Ae 


will appear in the expression of unitarity given below. In Eq. (5-163) the 2 axis is 
chosen along the incident momentum p, and 9, ¢ are the polar angles of pj. Finally, 
the sum over J runs over integer or half-integer values according to whether an 
even or odd number of half-integer spins is present in the initial or final state. The 
Jacob and Wick expansion (5-163) is easily generalized to an arbitrary two-by-two 
scattering process. 

If invariance under parity applies 


FAs hedsl) = Jt U,-4:— A —- al) (5-164) 
Similarly, if time-reversal invariance holds 
F >. sda) om Fo ty:t,aS) (5-165) 


expressing the symmetry of the scattering matrix. 

When the total energy s‘/? is below the inelastic threshold only the initial 
two-body channel contributes to the sum over intermediate states in Eq. (5-154). 
Projecting on angular momentum J results in the relation 


Fs) — F7"(s) = 2iA"*(s, m2, mz)s- 17 "*(s)F“(s) (5-166) 


+ The reader will not confuse the time-reversal operator Z with the scattering amplitude used 
throughout this section. 
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where 7 /(s) is considered as a (2S, + 1)(2S, + 1) x (2S, + 1)(2S, + 1) matrix 
in helicity space. Time-reversal invariance may be used to replace the left-hand 
side by 2i Im 7“(s). 

If the particles are spinless the formulas simplify considerably. In particular, 
BSo(o, 6,0) is nothing but the Legendre polynomial P,(cos 0), while Eq. (5-166) 
is solved by introducing a phase shift 6,(s) through 


QArl2 


— Fs) = —i(e79 — 1) = 2e'% sin 54(s) (5-167) 


If the matrix 7 “(s) is diagonal in some appropriate basis the same expression is 
true for each diagonal element. 


Such is the case for instance for pion-nucleon scattering. The two independent amplitudes 7{,2,1)/2 
and J 7 1/21). may be replaced by 


J J J 
F+=Fiya32 + F-12112 


corresponding to scattering in a state of given parity equal to +(—1)’*#/. 
Let us collect the expressions valid for elastic spinless scattering. With q standing for the center 
of mass momentum, 0 the scattering angle, and 6; the /th partial wave phase shift (possibly complex), 


do a l 
dt  64nsq? 


IF P 


F = 16x, (21 + 1); Pi(cos 6) 
0 


1,2 


GF) = — eins, (5-168) 
2q 


1 82 = 
cree ges! = = oa Cen: 


1/2 


s 
IF Ps Img, 


The last inequality turns into an equality below the inelastic threshold. 


5-3-2 Causality and Analyticity 


The macroscopic requirement that effects become manifest after the action of 
causes not only requires that causes be identifiable without ambiguity but also 
implies the concept of thermodynamic irreversibility, a phenomenon outside the 
realm of microscopic physics. Fortunately, the finite velocity of signals allows a 
weaker formulation which does not imply the choice of a privileged direction for 
the time arrow. We adhere to the view that space-like separated regions do not 
influence each other. Stated differently, local observables pertaining to such regions 
commute. We have seen that this can be further specialized to the commutation 
(or anticommutation) of the fundamental fields. For simplicity let us only discuss 
here the case of Bose fields. Mathematically the corresponding properties of field 
commutators translate into analytic properties of their Fourier transforms. This 
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observation is the basis of the original treatment by Kramers and Kronig of the 
diffractive index of light in a medium relating dispersion and absorption. Hence 
the name “dispersion relations” for the analytic representations of the scattering 
amplitudes. 

Anexample will illustrate these ideas. Consider the elastic scattering of particle 
A (mass m,) on a target particle B (mass m,). Both particles are assumed spinless 
to avoid cumbersome technical details without changing the essential conclusions, 
but may carry a charge. We therefore distinguish particle A from antiparticle A 
and use a complex field g to describe both of them. If gq; and q2 denote the 
initial and final momenta of particle A and p, and p2 the corresponding ones 
of particle B, the connected part of the scattering amplitude may be expressed as 


Spi= | ats d*y eMerrmar (1), + mally -pameSp2 |T@ Wop () 


We have absorbed a factor Z~'/* into the definition of the field. With gq, 
and q> inside the forward light cone, the time-ordered product in the right-hand 
side of Eq. (5-169) may be replaced by a retarded commutator 


Te" (y) p(x) > A(y° — x°)[9'), eC] 


without affecting the value of S,;;. This can be seen from the original derivation 
given in Sec. 5-1-3. Define the, source j(x) of the field y(x) through 


(Cl + ma)e(x) = i(x) (5-170) 


and assume for simplicity that @ and j commute at equal times. Taking translation 
invariance into account we may write 


Spi = (20)*6*(p2 + G2 — Di — Gi)iT 


JF =i [a ef: “<p ot2] (3). i(- 5) |p.» (5-171) 


q = 3(q1 + 42) 


Lorentz invariance implies that 7 depends only on the scalar products among 
the momenta, i.e., on two out of the three Mandelstam variables for on-shell 
particles. 

From locality, the retarded commutator ¢p2|6(z°)[j"(z/2), j(—z/2)] |p1> 
vanishes unless z* > 0, zo > 0. Inspection of Eq. (5-171) reveals that Z is an 
analytic function of the four-vector q in the so-called forward tube defined by the 
condition that Im q bea positive time-like vector. This follows from the assumption 
that the matrix elements of the fields are tempered (ie., polynomially bounded) 
distributions. Indeed, ifg = qr + ig:, the exponential in (5-171) provides a damping 
factor e”*°4: when both z and q; are positive time-like vectors. 

This example shows the direct relationship between the local properties of 
relativistic field theories and the analyticity of Green functions. 

Before we analyze the mathematical consequences of this result let us recall 
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the property of crossing symmetry. Instead of the process A(q1) + B(p,) > 
A(q2) + B(p2) suppose that we were studying the reaction A(G:) +#B(p,) = 
A(q2) + B(p2) involving the scattering of antiparticle A on the same target B. The 
corresponding amplitude 7 is given by 


F =i {ats ala) a9 (3) (-4)| [pi 


q = 2(91 + G2) 


Changing the integration variable z into — z, this can also be written as 


a i fas BOs) — (=a (5). i(- 5) |p.> (5-172) 


This amplitude differs from 7 given by Eq. (5-171) in two respects. The momentum 
q = 3(q1 + q2) has been replaced by —G =4(—@2 — q,) and the retarded com- 
mutator by the advanced one 


<p a2 (5). i(- 5) Ip1> > <pa| - a2 (3). i(- 5) [p> 


We may study the quantities 7 and J for arbitrary values of their arguments, 
instead of dealing with the actual physical processes where gq and @ lie in the 
forward light cone. Let us show that 


TF (q) — 7 @|\q--4 = iC(Q) 


vanishes in a certain domain. This is the Fourier transform of the commutator 


C(q@) = [ate e'*"4¢p,| |" (3), i(- 3| |pi> 


= (2n)* ¥ <po| j*(0) [n> <nlj0)|p1>54 (a i = = P| (5-173) 


\yy 


~ (2n)* Sal 0) n>"). 29*( 4 — PEP? +p, 


If the vacuum is an isolated point in the spectrum, meaning the absence of 
massless particles, the states |n> contributing to each of the above sums will 
be such that there exists a lowest positive value p? in each term (a vacuum 
contribution is excluded by the connectedness hypothesis). Consequently, C(q) 
will vanish outside a region bounded by the two hyperboloid sheets depicted 
in Fig. 5-8. One of them corresponds to a mass M,, the other to M_, and 
they are centered respectively at —4(p; + p2) and 3(p; + p2). We conclude that 
the amplitudes 7 (p2, q2; pi, qi) and J (p2, — 41; pi, — 2) coimcide at unphysical 
points corresponding to the unshaded region of Fig. 5-8. This is the property of 
crossing symmetry, which relates processes where a particle appearing on one 
side of a reaction is replaced by an antiparticle of opposite (therefore unphysical) 
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MM) 


// (Pp, + P2)/2 
sa 

Iql 
‘ V 


, +P2)/2 
¥ Figure 5-8 The two hyperboloid 
y sheets limiting the support of the 
Wy commutator in momentum space. 


The unshaded area is the coinci- 
dence region denoted @. 


four-momentum on the other side of the reaction. The statement of crossing 
symmetry will become meaningful if we succeed in showing that analytic con- 
tinuation allows access to the coincidence region starting from physical values 
for the momenta. This property is a remarkable consequence of field theory. 

We denote 7(q) the expression given in (5-172) with q substituted for 
— Gand observe that 7 (q) is analytic 1 in the backward tube, 1.e., for Im q a negative 
time-like vector. 

Thus 7(q) and Z(q) have a priori disjoint domains of analyticity and 
coincide on a real domain @. The celebrated edge of the wedge theorem, due 
to Bremermann, Oehme, and Taylor, allows us to conclude that 7 and F are 
analytic continuations of each other, and moreover that their common domain 
of analyticity is larger than the union of the forward and backward tubes and @. 


If we were dealing with functions of one complex variable only, the problem would be easily settled. 
Indeed, if f *(z) are analytic in the upper and lower complex z plane respectively and coincide on a 
segment of the real axis, a simple application of Cauchy’s theorem proves that they are branches 
of the same analytic function. The coincidence points are in fact analyticity points. The present 
situation is obviously more complicated. Around each coincidence point there are holes corresponding 
to space-like imaginary directions where neither 7 nor 7 are defined. Hence the pictorial name 
“edge of the wedge.” Moreover, analysis in several complex variables uncovers new properties 
without equivalence in the case of one complex variable. Such is the notion of a holomorphy 
envelope. A function of several complex variables analytic in a domain @ can at least be extended 
to a domain 9 > J, with the property that through any of its boundary points we can find an 
analytic manifold, i.¢., the set of zeros of an analytic function, lying entirely outside Z except for 
the boundary point in question. This property is referred to as pseudoconvexity (in analogy with 
ordinary convexity), with analytic manifolds replacing planes. In the case at hand, even though a 
purely geometric construction of @ is possible, an interesting representation generalizing the Kadllen- 
Lehmann representation for the vacuum expectation value of the commutator is available to solve 
the problem. 

The reader may wonder whether the original analyticity domain, including the forward and 
backward tubes, was not enough in simple cases such as forward scattering with p, = p2 = p, 
41 = 42 = q. The following remark will dissipate doubts. Let a complex vector gq lie on the mass 
shell and write q = gg + ig;. Then the condition mz = q? = qk — q? + 2iqg’q, states that gg and q; 
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are orthogonal and that it is impossible for q; to be time-like. If it were so then from the ortho- 
gonality condition qx would be space-like, g2 < 0, and q} — qi would be negative. Thus only the q 
with negative or complex square mass belong to the tube. For such unphysical values a dispersion 
relation can be written at once, but we are left with the task of finding a suitable analytic 
continuation in q? in order to obtain some information about the physical process. 


5-3-3 The Jost-Lehmann-Dyson Representation 


This is a representation for the matrix element of the commutator C(q) given 
in Eq. (5-173) consistent with the spectrum properties in momentum space and 
the support properties in configuration space. Recall the expression for the free- 
field commutator A(x, 7) [Eqs. (3-55) and (3-56)] which vanishes outside the light 
cone and has an odd Fourier transform concentrated on the hyperboloid of 
mass yi”. The Jost-Lehmann-Dyson representation gives C(q) as a superposition 
in configuration space or a convolution in momentum space of such free-field 
commutators in the form 


Cig= | a*k du* e(q° — k°)8[(q — k)? — wp} p(k, u) (5-174) 
S 


The weight p is nonvanishing only for those values of k and yp? such that the 
hyperboloid (q — k)? = p? does not intersect the region @, in which case it is called 
admissible. Such hyperboloids will have a center k lying in a bounded domain 
equal to the intersection of the future light cone centered at —[(p; + p2)/2] and 
of the past light cone centered at (p, + p2)/2. The lowest limit on the p integration 


is then 
° + : 
bag = max {0 Ms ~ (M52 +k). (EP x) } 


It is clear that C(q) given by (5-174) has all the required properties. Dyson has 
shown that the converse is also true, namely, that C(q) necessarily admits a 
representation of this form. We refer the interested reader to the articles quoted 
in the notes. 

To derive the corresponding representation for the amplitude, we simply have 
to compare Eqs. (5-171) and (5-173). Up to a factor i, the difference lies in the 
presence of a step function multiplying the commutator in configuration space. 
We have to be careful about this multiplication of distributions which might leave 
some undefined contribution concentrated at the origin in z space. Ignoring this 
complication for the time being, we obtain in momentum space a convolution 
which yields 


aoe | ged AG) (5-175) 


an js qQ—kP- 

The physical amplitude is recovered by taking the limit of real positive time- 
like values of g, with a vanishingly smal] imaginary part in the forward light 
cone. 
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It might happen that the integral over yz? diverges for large values, a reflection 
in momentum space of the poor definition of the retarded commutator. However, 
subtractions could be performed in the form 


1 p(k, uw?) — [(q— ky + wo | 
(= -z(, dea? PRP) | SE | +P 6176 


with n an integer and P(g) a polynomial in q, without altering the analytic 
properties following from Eq. (5-175). We shall therefore omit writing down these 
subtractions explicitly. 

The representation (5-175) exhibits 7(q) as an analytic function for all real 
and complex points which do not lie on admissible hyperboloids. 


This analyticity domain J, derived from the Jost-Lehmann-Dyson representation, is hard to visualize. 
We borrow from the work of Bros, Froissart, Omnés, and Stora the following helpful geometrical 
construction. The trick is to introduce a fifth coordinate z and map the original complex four- 
dimensional gq space on the quadratic surface P:z = q?. In this way the denominator in Eq. (5-175) 
is replaced by the linear expression (in z and q) z — 2q°k + k? — yx? and the admissible hyperboloids 
by the corresponding planes with (k, u?) belonging to S. We then associate to each complex point 
(z,q) a complex straight line L(z,q) described parametrically by Rez +AImz, Reqg+Almdq. If 
(z, g) belongs to P so does its conjugate, so that L(z,q) intersects P at two points (corresponding 
to A = +i). From the properties of quadratic surfaces it follows that the real points of L(z, q) (obtained 
for 2 real) cannot be on P, so that on a real picture L(z, q) lies entirely above or entirely below P. 
[A special case would be that L(z, gis entirely contained in P.] 

The real coincidence region is mapped into P in a region limited by the two planes 
[(pi + p2)/2]? + q:(pi + p2) + z < M3. and the admissible planes do not cross this coincidence region 
@. From the fact that py? > 0 it follows that they are above a plane tangent to P. Since we are 
dealing with planes, this defines an excluded region, the convex hull of @ (call it ¢), which cannot 
be intersected by an admissible plane (Fig. 5-9). We now have a description of the analyticity domain 
as follows. A complex point (z, q) belongs to it under one of the following circumstances: 


(a) L(z, g) lies entirely below P. This case corresponds to the original tubes. 
(b) L(z, q) intersects %. Indeed if (z, q) would belong to an admissible plane, the same would be true 


for its conjugate. From linearity L(z,q) would then belong to the admissible plane and would 
not intersect @. 


5-3-4 Forward Dispersion Relations 


We are now in a position to examine the elastic forward amplitude when 
41 =42=4, Pi=p2=Pp, as a function of incoming energy v = q: p/m = 
(s — mz — mé)/2my. We wish to show that it is analytic in a cut plane, with the 
cuts running along parts of the real axis. This is a nontrivial result in view of 
the remark at the end of Sec. 5-3-2. Of course, we require that q? = m2, the 
physical mass-shell condition. 


This is obtained by intersecting the surface P, from the previous construction, by the plane z = m2, 
leading to a hyperboloid. We have to study the corresponding intersection of ¢. For a complex q 
lying on the mass shell, the real line L(m@,q) does not intersect this hyperboloid with which it has 
already the two complex points (mz, q) and (m2, q*) in common. We have noticed that for fixed p 
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Figure 5-9 The region % (shaded area), convex hull of the 
coincidence region in five-dimensional space. 


Lorentz invariance implies that 7 is only a function of the component of g along p. Taking the 
latter along the time axis means that we can restrict our attention to vectors q with only two non- 
vanishing components, gy and q,, say. The problem is in effect two dimensional with a third 
component = added for convenience. The complex point (m2, q) on the hyperbola g3 — q? = m? will 
have (Im q)? < 0 and will be a point of analyticity provided L(m2, q) intersects the convex hull @ in 
the (qo. qi) plane. The condition for this is that 6 meets both branches of the mass-shell hyperbola. 

Now @ is given by (p + gq)’ < M% or, equivalently, m? + 2p-q + 2 < M%. The condition that 
meets the upper branch is mj + 2mam, + m2 < M2 or (ma +m)? < M3. Similarly, the condition 
that it meets the lower branch is (m, + m,)? < M2. These are therefore the criteria that eliminate 
the occurrence of complex singularities in the complex vy plane. On the other hand, we know that 
M2 <(m, + m,)*, corresponding to intermediate states A + B or A+ B in the sums of Eq. (5-173). 
From this we conclude that forward dispersion relations (without complex singularities) can at best 
be proved marginally using the above method. 

It may happen that M. or M ., or both, are strictly smaller than the elastic threshold. One 
case frequently encountered can be disposed of without too much trouble. This is when an inter- 
mediate isolated state [mass Mj < (m, + mp)? | occurs below the threshold. Its contribution is then a 
pole in the energy variable. Multiplying the amplitude by (p + q)? — Mé does not modify the analytic 
properties but eliminates the singularity. In pion-nucleon scattering, for instance, the nucleon inter- 
mediate state contributes such a pole. 


Let us assume that apart from such poles the condition M2 > (m, + m)* 
is satisfied. We then conclude that in the case of forward scattering the following 
properties hold: 


1. The amplitude is analytic in the complex v plane except for cuts extending 
along the real axis from m, to +00 and from —oco to —m, and possible poles 
in between. 

2. The amplitude is real in between the cuts so that its discontinuity across the 
cuts is purely imaginary. This follows from Eq. (5-173) applied to forward 
scattering. 

3. The amplitude is bounded by a polynomial for large v. This stems from the 
tempered character assumed for the fields and holds a fortiori for complex v. 

4. Finally, if A = A crossing symmetry requires that 7 (—v*) = 7 (v)*. 


Using this information and Cauchy’s formula, a simple analytic representation 
is obtained. For simplicity assume that, as in property 4, A= A and that 7 
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vanishes at infinity. Then we can write 


fo @) 


dv’ Im Tw ) (5-177) 


1 
Vm ae = 
70) = potes + 2 | Ta eee 


(m, +m)? 
If Z(v) would grow like a power of v a similar representation would hold for a 
function obtained by dividing v by a real polynomial in v. 

We must emphasize that only physically measurable quantities enter the 
above representation. The absorptive part Im -7(v) is related to the total cross 
section through the optical theorem, Eq. (5-156), while the possible pole terms 
have as residues products of coupling constants. Indeed, formulas of the type 
(5-177) enable us to measure such coupling constants. 


As an illustration, consider the time-honored example of pion-nucleon scattering. For arbitrary charge 
we write the two independent amplitudes in the form 


TF = i(p2)[A + igi + G2)BJu(p:) (5-178) 


with u(p2), u(p;) the on-shell Dirac spinors describing the final (momentum p;) and initial (momentum 
p1) nucleons. The incoming pion has momentum gq; and the final one q2. The kinematical invariants 
are 


s=W*=(,4+m) 
t = (41 — 42)” = —2q"(1 — cos 8) (5-179) 
u = (py — q2)? = 2M? + 2y?—s—t 
Here M and w are respectively the nucleon and pion mass, 6 is the center of mass scattering angle, 
q is the pion three-momentum in this frame, and W is the total center of mass energy in such a 
way that 
p= tM? = LW = MP = 27) 


re (5-180) 


The differential cross section is, according to Sec. 5-1-1, 


do) of Meas 

dQ a 7 oe 
and in (5-178) the functions A and B only depend on the invariants. Anticipating the discussion of 
symmetries (Chap. 11) we know that strong interactions are isospin invariant and that the nucleon 
has isospin 3 and the pion isospin 1. Therefore all the channels are described in terms of two 
reduced matrix elements corresponding to total isospin + or 3, that is, 4 and B are combinations 
of A’, A® and B’, B? respectively for any process with definite charges. This gives the following table: 


S fe 
HPSe PA gs. ott) _ gt-) 
tTno-wt n 


apn ig? 27a FET (5-182) 


nnn 


The last is called a charge exchange reaction. Since only two reduced amplitudes describe three 
measurable processes, this implies triangular inequalities among cross sections. It is convenient to 
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introduce the combinations 


F'*\qy, 42) = +T*— go, — 41) (5-183) 


respectively even or odd under s++u exchange. From the explicit structure of Eq. (5-178) and using 
the variable 


(5-184) 


odd under crossing (note that for r= 0 it coincides with the laboratory incoming pion energy), it 
follows that 


A‘*)(—y, t)= + Ay, 1) 


(5-185) 
BY*)(—y, t) = F By, 2) 


At t=0 the amplitudes are analytic (in v) in a plane cut along the real axis from the elastic 
threshold [s =(M + u)?,v =p] to infinity (the s cut) and from minus infinity to the u channel 
threshold [u = (M + y)?, v= —y]. 

Furthermore, isolated poles occur along the real axis corresponding to the nucleon intermediate 
state in the s and u channels. We may write the effective pseudoscalar pion-nucleon interaction 


Ln = igennWNataysW (5-186) 


with t, being the nucleon isospin matrices and z, (a = 1,2,3) the three hermitian components of 
the pion field. Thus the pole contributions are 


Bit@it+M 


= as —¢,+M 
F pole = Jann(P2)T stays —z-——. ———._ ysu(p1) + Gewnil(Da)tat ps eee su(Pi) (5-187) 
w? + 2pieds — 2p 


Now 2p; °qi = 2Mv — 1/2; we may use the Dirac equation satisfied by on-shell spinors to write 


ae z (41 + G2)/2 fh + g2)/2 
T pole = a —* cipal reer ta 2 E ape He, 25q) FO hap) 
Vp-v pty 
(5-188) 
2 
ry LB t 
"»= 9M 4M 
We recognize that the poles only contribute to the B amplitudes with 
BY) = Gann ( 1 = 1 ) 
2M = +yv 
ia (5-189) 


= Gann 1 1 
Be) = 
2M (— 7 Wo ae .) 
Note that the location of the poles is t dependent. 


Inserting the value of v, at t= 0 and ignoring for the moment the question of subtractions, 
we find the forward dispersion relations in the form 


A*y, 0) = | av Im A" 0) ae —) 
; r ae ee yoy 


(5-190) 
nnn ee. sd. i _ ee al 
B®) Ee 0 + 
ih tin (1g 1) 1 —- 5 y—y w+y 
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In fact analyticity in s may be proved for fixed (negative) ¢ in a finite range —tw Sts 0, so that 
(5-190) may be generalized to 


[2] 


1 1 
A'*\y, 1) = - dy Im A™)(y’, a(, +— ) 
Te ) atta vy—-y w+y 


2 1 he " Linge) 
Bey, ) ae u —-F )+ ‘| dv’ Im BU", a, at =] 
2M \v,—-v vty) tt Jusyam VV ay 


These relations may be used in conjunction with a partial wave analysis along the lines of Sec. 5-3-1. 
For this purpose, we rewrite the quantity (M/4nW)Z appearing in the cross section (5-181) as 


(5-191) 


las a qi) 


M 
— (po, A2)[A + 31 + G2)BJu(pi. A1) = fal] f+ fs balan (5=1192) 


4nw 


where A, and A; are the initial and final nucleon helicities and 7, 7’ Pauli spinors in such a way 
that [compare with (2-37)] 


p+M Ge 
u(p, A) = SSS ) 
/2M(M + E) 


With E denoting the nucleon center of mass energy this yields 


E+M 
f= ye UW — MA) 
(5-193) 
E— 
fr= enw ie A+(W + M)B] 
The 2 x 2 matrix appearing in (5-192) admits an angular decomposition of the following form: 
1 : 
Faas = 7 +4) fir, 2%,2.(9, 9, 0) (5-194) 
J 


where J runs over half-integer values. 
Invariance under parity yields fi2,1;2 = f21)2,-1/2 and fij2,-1/2 = f21/2,1/2. Time reversal does 
not give any new information. Using the spinors of definite helicities we have, at = 0, 


fuse =f-12,-12 =(h tf Oud = (4 +h) cos 5 


, (5-195) 
fij2,-12 = —f-112.12 =(hh — fr) x(-4) = (fi — f2) sin 5 
As was already mentioned, parity diagonalizes the matrix f;, ,, in such a way that 
f£ = ftlasja tf 1)2,1/2 = 2+ sin 65,4 (5-196) 


The phase shifts 6,,. are real below the inelastic threshold and correspond to the scattering in the 
channel of total angular momentum J and parity +(—1)’**/*. In nonrelativistic notation these states 
would be obtained from well-defined orbital angular momentum / with J =!+4 so that the 
corresponding phase shifts are also traditionally denoted 6, and 6,,,,- respectively. Let us now 
use the explicit formula for the Wigner 2 functions: 


(e) 
(J + 4)dt)2,1;2 = cos 5 (Po+1/2 — P5-1)2) 


(5-197) 
_ 
(J + 9)d2 1)2,1)2 = a (Po+1)2 + P5-1)2) 
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With the I notation (5,4 = 6),+, 5),- = 614;,-) this yields the following expansions 


fi(cos 6) = ee P}+,(cos 6) fi,+ — Pi_1(cos 6) fi, - 
l= 


fi{cos 0) = py Pi(cos 6)(fi,+ —fi,-) (5-198) 
fie = e's sin 61,4 
q 


Of course, for ! = 0 the only surviving amplitude is fo, ,. The question of subtractions in the dispersion 
relations should now be investigated. We postpone some general statements until the next subsection 
except to say that in pion-nucleon scattering it can be shown that the only amplitude requiring 
a subtraction is A‘*?. 

A classical application of Eqs. (5-190) is to the determination of the effective coupling constant. 
The principle of the method is to evaluate accurately from low-energy phase shift analysis the B 
amplitude for x‘ -proton elastic scattering (it is, in fact, dominated by the J = 3, T =} resonance) 
and to compare its real part in the forward direction with the value given by the dispersion integral. 
The latter is also dominated by its low-energy contribution so that the approach is consistent. The 
value quoted by Hamilton and Woolcock is 


prank (94) _ 0.081 + 0.002 (5-199) 
4n\ 2M JO 

Dispersion integrals play a leading role in disentangling the various phase shifts at intermediate 
energies. As an exercise we suggest extending the formalism to the t-channel reaction nm > NN. 


5-3-5 Momentum Transfer Analyticity 


In the sequel, we disregard the spin of the external particles. Even though it is not a straightforward 
matter to include it, this can be done without modifying the results. Instead of using as a starting 
point of our discussion a reduction formula involving a particle in both the initial and final states, 
as we did in Sec. 5-3-2, let us rather reduce two initial particles in an elastic process with momenta 
p, and q, and associated fields wy and g. Thus we would initially write the connected S-matrix element 


<Pas 42|S [pin qude = = dx* d*ye Merx*ar-y) (C), + mé)(Oy + m2)<p2, gal TH(x) ey) [0>- (5-200) 


Returning again to the derivation of this formula, we observe that we might as well have used a 
retarded commutator instead of a chronological product in the physical region of the elastic process. 
Dropping inessential contact terms, we may therefore consider the replacement (with j, and jy the 
sources of the w and ¢@ fields) 


(Cx + me(D, + m2) <p, g2| THX) P(y) [0 > <2, G2] O(x° — y°) Liv), jo(Y)] [0% (5-201) 
Taking momentum conservation into account, we obtain 


€P2s G2| S|p1. 41 < = (2)* 6*(p2 + qo — pi — Gu)iT a1 


F » ea x 
Tyi = i ate ellai— Pd -*/2 <P2, ao) 0009| (5) ie(-3) | |0>. 


This representation shows that 7 is analytic in the variable q = (q: — p)/2, as follows from the 
Jost-Lehmann-Dyson formula, and we may repeat an analysis similar to the one given for forward 
scattering. We take the total incoming momentum p = p; + g: as a time axis. Thus we shall work in 
the center of mass frame and exploit (5-202) to obtain analytic properties in the cosine of the 


(5-202) 
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scattering angle. The coincidence region of advanced and retarded commutator, which could be used 
as well, is limited by the hyperboloids corresponding to the lowest masses M, and M2 of inter- 
mediate states |n> contributing respectively to the matrix elements 


<p2, q2| jy(0)|n><n| jo(0)]0> — <p2, 42| jo(0)|n><n| jy(0) |O> 


(5-203) 
(f+4) <a (f-4) <s 
2 2 
The stability of particles a (momentum q,) and b (momentum p;) imply that 
Mi>m2 M3 >mi (5-204) 
For on-shell scattering 
(2 + a) =q=m (: = a) = pi =m (5-205) 


Seen in the five-dimensional space z, q (recall that we work on the surface P such that z = q”) these 
two linear conditions determine a three-dimensional linear manifold which must clearly have a real 
intersection with P. Moreover, the planes defined by 


2 


2 
7 +2p-q+2= Mi 7 =2p-qt+z= Mi (5-206) 


Figure 5-10 The analyticity domain in the relative three-momentum on the mass shell with real 
intersection Sj. 
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intersect P on two hyperboloids limiting ¢. We denote by @ its convex hull. By virtue of the stability 
conditions the three-dimensional linear manifold representing the mass shell (5-205) intersects P along 
a sphere S, of constant relative three-emomentum q? = pi = qj. The intersection with @ is also a 
sphere S, of radius R larger than |p,| and is a function of the masses (Fig. 5-10). Thus the analyticity 
domain may be entirely described in terms of q. For an arbitrary complex point on S,, 


Req-Imq=0 
(5-207) 
Re q? —Imq’ = @ 
We recall that q’ is to be thought of as the common square length of all three-momenta in the 
center of mass for elastic scattering. To be a point of analyticity the complex point Req +iIlmq 
must be such that the real line Re q + 4 Im q intersects S,. Since Im q is orthogonal to Req this 
implies that Req lies inside S, while the first condition says that Req? >|p,|*. The boundary 
corresponds to (Re q)* = R?. Let us translate these results in terms of the scattering angle 


The length square qj is also fixed to q?. Thus /ex = q°G2, where g2 is a unit fixed vector and 
the variable x varies inside a complex domain bounded by an ellipse, as follows from Eqs. (5-207), 


/ajRex=Rcosa  ,/qiImx=,/R*-—qjsina (5-208) 


This is called the small Lehmann ellipse 


(5-209) 


The foci of this ellipse are the points x = +1 and the semimajor axis is R/./q?. 

More generally, the proof applies to any matrix element of the form <p,,;,|P1,41, in) and gives 
sufficient analyticity properties to allow for a convergent spherical harmonic expansion in relative 
angles, expressing the rotational invariance of the S matrix. 

An elliptic domain is the natural convergence region of a partial wave decomposition, a series 
in Legendre polynomials in the simplest case. To compute the actual value of R is not so easy. It 
turns out that 

RE, (M3 — mi)(M3 ~ mi) oo 
‘ q’ cs (My = M2)"] 
The existence of an elliptic analyticity domain for the elastic amplitude implies that its absorptive 
part is analytic in a larger ellipse (the large Lehmann ellipse). This may be seen below the inelastic 
threshold as follows. Recall from Eq. (5-168) that 


T =16n y (21 + 1)7; (s)P,(cos 6) 
4 (5-211) 


img 


From an estimate of Legendre polynomials for large / it follows that a necessary and sufficient 
condition (generalizing Abel’s criterion for Taylor series) for the expansion (5-211) to converge in an 
ellipse of semimajor axis cos 89 > 1 is that the least upper bound on | 7; |" be 


1 


cos 09 + ./cos? A — 1 
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The absorptive (imaginary) part is defined through a similar series with .7, replaced by Im .7;. From 
(5-211) it converges in an ellipse of semimajor axis Cos 9, given by 


1 1 ) 1 
COs = 4) cos2 0; — 1 5 F A> + ./cos? Sai 2 cos? O) —1 + Ja cos? Wy — 1)? — 1 


that is, cos 8; = cos 26 


These analytic properties in s and ¢ obtained separately have been generalized to analytic properties 
in the joint variables through the work of Mandelstam, Lehmann. Bros. Epstein, and Glaser. Martin 
and his collaborators have furthermore made a profitable use of positivity properties of the absorptive 
part. We shall have the occasion in Chap. 6 to return to this subject in the context of the 
perturbative expansion. 


To close this section let us mention two important results which put constraints 
on the large-energy behavior and are in close relation with analytic properties. 

The first of these is the bound obtained by Froissart on the behavior of total 
cross sections for large s. The intuitive idea is that if strong interactions are of 
finite range the total cross sections should be bounded at infinity. Things are, in 
fact, not as simple, even from the experimental point of view. The reason lies 
in the fact that velocity-dependent forces are also expected when interactions are 
mediated by the exchange of particles with spin. 

The assumptions needed in the derivation are the following: 


x 


1. The enlargement of the analyticity domain is such that for fixed || < to 
dispersion relations in s may be written with at most n subtractions. A poly- 
nomial bound follows in the axiomatic framework and is also a term-by- 
term consequence of lagrangian field theory. In fact, tp may be proved to be 
equal to 4m2 for pion-nucleon scattering, for instance. 

2. From the relation t = —2q7(1 — cos @) and from the fact that the Legendre 
series for the absorptive part may be shown to converge within an ellipse 
of semimajor axis cos 0) = 1 + to/2q? (an improvement over the large Lehmann 
ellipse arising from positivity), it follows that 

ee 

Cee) 

For fixed x larger than one, P)(x) is positive and increases exponentially 

with 1. 


(5-212) 


In essence P;(x) behaves as (x + J/x- 1)' so that the series for the absorptive 
part is effectively cut off at some Imax of the order 


q 


os (5-213) 
0 


oe os 


for s large. From unitarity 
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the series for the amplitude in the physical region where | P:(cos 0)| < 1 is bounded 
by 16m ) im (21 + 1)|.7;| and again from unitarity | Ti| < ./s/2q ~ 1. Thus 


| 7 (s, cos 8)| < 16712... = constant s(In s) (5-214) 


, An accurate estimate of the constant may be obtained in terms of the maximum 
number of subtractions n. The latter can, in fact, be reduced to two using a 
similar bound as (5-214) in the u channel and the Phragmen-Lindel6f theorem 
to show that the bound s(In s)? holds in the complex directions; thus n = 2. The 
final estimates of the Froissart bounds are then 


ez 
| 7 (s, cos 6)| < 161 = (In s)? (5-215) 
ce) 


and from the optical theorem 


l 
rot < (In s)? (5-216) 
0 


Even though the scale of the logarithm is not ascertained, it is striking that the 
trend exhibited by the high-energy data obtained by the latest generation of 
accelerators is in qualitative agreement with the prediction of this bound. It is 
interesting to interpret the angular momentum cutoff (5-213) in terms of an effective 
opacity radius p = Imax/q ~ In Shi: On the grounds of a potential analogy, 
p would be of the order 1/./to (1/2m, for pion-nucleon scattering). It only differs 
from this naive prediction by a logarithmic increase with energy. 

Another property predicted on the basis of analytic properties is the asymptotic 
equality of particle and antiparticle total cross sections from a given target. 

The theoretical observation is originally due to Pomeranchuk. The statement 
of his theorem needs some qualification in view of the fact that total cross sections 
may increase as (In s)? for large s. If o and & denote the two cross sections, then 
under mild assumptions we can show that o/¢ > 1 as s goes to infinity. 

Much more could, and should, be said on the whole subject of the interplay 
of locality and unitarity as dictating the framework of scattering. The reader 
will undoubtedly find a more substantial treatment in the large literature devoted 
to this subject. 


NOTES 


For the Kallen-Lehmann representation see G. Kallen, “Quantum Electro- 
dynamics,” Springer-Verlag, Berlin, 1972. The asymptotic theory is covered in 
the work of H. Lehmann, K. Symanzik, and W. Zimmermann, Nuovo Cimento, 
vol. 1, p. 205, 1955. Additional terms in the current commutation relations were 
discussed by J. Schwinger, Phys. Rev. Lett., vol. 3, p. 296, 1959. 

The perturbative calculations are by now classroom exercises. A wealth of 
information on the Compton effect can be found in R. D. Evans, Handbuch 
der Physik, vol. XXXIV, p. 218, 1958. 
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On the foundations of dispersion relations the reader may consult some of 
the following references according to his or her taste: N. N. Bogoliubov and 
D. V. Shirkov, “Introduction to the Theory of Quantized Fields,” Interscience, 
New York, 1959; Proceedings of the 1960 Les Houches Summer School, 
“Dispersion Relations and Elementary Particles,” edited by C. de Witt and 
R. Omnés, Hermann, Paris, 1960, in particular the lectures of M. L. Goldberger, 
A. S. Wightman, and R. Omnés; 1960 Scottish Universities’ Summer School, 
“Dispersion Relations,” edited by G. R. Screaton, Oliver and Boyd, Edinburgh, 
1961; and Proceedings of the International 1964 School of Physics Enrico Fermi, 
“Dispersion Relations and Their Connection with Causality,” edited by E. P. 
Wigner, Academic Press, New York, 1964, in particular the lectures by M. 
Froissart. 

On pion-nucleon scattering see, for instance, the lectures of J. Hamilton in 
“Strong Interactions and High Energy Physics,” 1963 Scottish Universities’ 
Summer School, edited by R. G. Moorhouse, Oliver and Boyd, Edinburgh, 1964. 

For a summary of the work on analyticity and its physical consequences 
derived from first principles, including a discussion of various high-energy bounds, 
see the lectures of A. Martin in “Physique des Particules,” Les Houches 1971 
Summer School, edited by C. de Witt and C. Itzykson, Gordon and Breach, 
New York, 1973. 

The foundations of S matrix theory are studied in “The S-Matrix,” by 
D. Iagolnitzer, North Holland; Amsterdam, 1978. 


CHAPTER 


SIX 
PERTURBATION THEORY 


We develop the relativistic perturbation theory and present the technique of 
Feynman diagrams. Some attention is paid to the case where the interaction 
contains derivatives. We identify the expansion as a series in Planck’s constant 
and study some elementary topological properties. The parametric representation 
of Feynman integrals is introduced and used to define the euclidean continuation. 
Analytic properties and discontinuity formulas are indicated. 


6-1 INTERACTION REPRESENTATION AND 
FEYNMAN RULES 


The basic problem is the calculation of Green functions, i.e., vacuum expectation 
values of time-ordered products of interacting fields 
G(X1,..., Xn) = <0| T[p(x1) °°: P(%n)] |o> (6-1) 


For brevity’s sake we shall consider a generic scalar field (or collection of fields) 
gy, and shall focus our attention on quantum electrodynamics in the next section. 
It is convenient to collect all the Green functions in a generating functional 


Z(j) a 2 < [ate : = dame }(X3)* : * (Xn) G(X1, an) 
= <0| T exp : fas jonot9 {[0> (6-2) 
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The S matrix may be expressed in the compact form we saw in Eq. (5-38): 


6 : 
S =:exp (|e jet c + m7’) sit as |): Z(j)| j=0 (6-3) 


This relation is itself a generating functional for reduction formulas. The field 
(p(x) satisfies dynamical equations derived from a lagrangian 


L(y) = Lo(9) + Lin(9) | (6-4) 


where Y is a quadratic free lagrangian and Y,,, is an interaction lagrangian. 
For simplicity Y,,, is assumed at first to contain no field derivative. Since we 
are generally unable to find an exact expression of Green functions, we rely 
on a perturbative method, with Y,,, considered as a small perturbation to Zp. 
Generally Y,,, depends on one or several coupling constants and the perturbative 
expansion will be a power series in this or these coupling constants. Even when 
the coupling constant is small, as in electrodynamics where « = e?/4nhc = 1/137, 
we do not expect it to converge. As we shall see later, there are some indications 
that it is, rather, a divergent asymptotic series. In any case, the perturbative 
expansion may be considered as a formal mathematical series, from which much 
information may be extracted. 

When the theory contains several coupling constants, the natural “small” 
parameter may be identified as 4. The corresponding series has also a topological 
characterization. It is a loopwise expansion, according to the increasing number of 
independent loops of the associated diagrams. Therefore, the expansion may 
simultaneously be regarded as a small coupling series, a semiclassical expansion 
about the free-field configuration, or a topologically well-defined procedure. 

As in the previous chapters, we proceed to a heuristic construction and 
neglect at first various difficulties such as the infinite volume limit, the possible 
occurrence of ultraviolet divergences, or the proper definition of the lagrangian. 
These points will be reconsidered in the following. 


6-1-1 Self-Interacting Scalar Field 


As explained above, we first confine ourselves to a single scalar field ~. We 
have shown in Chap. 4 that there exists formally a unitary operator U(t) which 
transforms the free field ¢;, into the interacting one ¢(t): 


o(x) = U-*(t)@in(x)U(t) ~~ with t = x° (6-5) 

where Ui 1 exp E [- dt’ Hint «| (6-6) 
Since the coupling terms contain no field derivative, 

Hint(t) = — lo. dx Lin [@in(x)] (6-7) 


More generally, let U(t2, t1) stand for 
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t2 
U(t2, ty) = T exp |: | dt’ [ex Fiee(X, a] 
t 


1 


U(t,t)=1 (6-8) 


‘where Yin, is expressed as before in terms of ¢;,. This operator enjoys the 
following properties: 


U(t, — co) = U(t) U(ty, t2) U(t2, ts) = U(ti, ts) 


We shall now derive the fundamental relation 
<0| Texp [ [ats er) \0> 


<0| T exp : far {Fine LQin(x)] + io | |o> 


= (6-9) 
<0| T exp ‘ xray foaco]} |o> 
or, equivalently, after identification of the term of degree n in j: 
<O| T@in(1)** * Pin(Xn) EXP \ [ats Pint foute} |o> 
Gee, EX) = (6-10) 


<0| T exp f ox Coats |0» 


In the right-hand side of Eqs. (6-9) and (6-10), the T symbol acts on the whole 
expression. This means that after an expansion of the exponential in powers of 
Lin all the fields gi, have to be time ordered. The vacuum state |O> is to be 
understood as the “in” vacuum state |0 in >. 

We assume that the interaction is adiabatically switched off in the remote 
past; therefore lim,.-., U(t) = J. Let us consider a set of time-ordered points 
X1,...,Xp, that is, satisfying x? > x? > --- > x°. We then have that 


<0] Te(x1)-++ (Xn) |0> = <0] p(x1)--- P(%n) | 
= €0| U~*(t1)@in(x1) U(t1, t2) @in(X2)** 
x U(tn-15 tn) Pin(Xn) U (tn) |O> 
Introducing a very large time t such that t »>t,;, —t «t,, we may write 
U(t,) = U(tr, —t)U(—t) 
and U~*(t;) = U- (UC, ty) 
Thus 
£0] Te(x1):+* el») [> 
= <0|U~*(t) T@in(X1)*** Pin(Xn) eXP E { at Hal | U(—1)|0> (6-11) 
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This formula remains valid for an arbitrary order of times x?,...,x, within the 
interval (—t, t). Letting t go to infinity, we get 


lim U(—t)|0> = |0> (6-12) 


while in the same limit <0|U °‘(t) must be equal to <0] up to a phase since the 
vacuum is assumed to be stable. Therefore 


lim <0| U~*(t) = lim <0| U~ *(t)|0> <0 


eee ae [oy © Or 
Inserting (6-12) and (6-13) into (6-11) leads to the desired Eq. (6-10). This derivation 
is nothing but heuristic, so we may consider Eqs. (6-9) and (6-10) as a mere 
definition. 

It is gratifying —and this was not a trivial point in the early days of relativistic 
perturbation theory —that Eqs. (6-9) and (6-10) lead to explicitly covariant expres- 
sions. From Wick’s theorem, the evaluation of T products reduces to combinations 
of normal products and covariant propagators. The bookkeeping of these com- 
binations may be translated into a diagrammatic language, which we shall now 
present. 

We expand the exponential in the numerator in Eq. (6-10). From now on, 
the index “in” will be omitted: 


G(x1,..., Xn) <O| T exp : [as %u() |0> 


= ie "i <0| T@(x1)*** O(Xp) fer, oe -d*y, Pini)" Geis o> (6-14) 


To be explicit, we shall consider the so-called a“ theory: 
a [ p(x) ] =a ere p*(x): (6-14a) 


but all the following reasoning may be easily generalized to an arbitrary poly- 
nomial nonderivative interaction. The normalization 1/4! has been introduced 
for later convenience, as it will eliminate combinatorial factors. The minus sign 
in front of the coupling constant eo be the one used in a classical field theory 
of total lagrangian (0,:~)"/2 ~ m* 7/2 — Ap*/4! to insure the stability of the 
solution @ = 0 of the equation me motion. After substitution in Eq. (6-14), we get 


GGi,-- 2, Xn) CO texp | - = [as -o%¢e): | 
al 


=> = <O| Tp(x1)-*- ote) | aty dy ONS a prone ee lo 


(6-15) 
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To evaluate the vacuum expectation value of the time-ordered product of free 
fields we use Wick’s theorem [ Eq. (4-65) ] with the elementary contraction 


wren ' d*k i : 
RX; — @) VE 5 0 = aS ee — ik +(x—y) x 
(x) 9(y) = <0| Te(x) p(y) |0> (& ae (6-16) 
We recall that two fields appearing under the same symbol : : must not be 


contracted. A given set of contractions is conveniently represented by a diagram, 
the contribution of which results from rules, which we first describe in configuration 
space. Since only Green functions with an even number of external fields are 


nonvanishing in the theory (6-14a), hereafter we replace n by 2n. To the pth- 
order term 


ea - prt . 

(= 1)" 0| Fels)“ olay) “PY... 3 Wadi gy (6-17) 
we associate a sum of contributions, each of which corresponds to a Feynman 
diagram. On this diagram, x4,...,: X2, are the endpoints of the external lines and 
V1,.--, Vp are the vertices. Four lines originate from each vertex, and all the lines 
join two distinct points: x, and x,, or x, and y,, or y, and y,(a # B). A line joining 


z; to 2; (2 =X, Or y,) corresponds to the propagator (zz). To each vertex, 
we associate the weight —id. Finally, there is a symmetry factor resulting from 
the number of possible contractions leading to the same diagram. If we contract 
all the four fields originating from a monomial : ¢*(y,):/4! with four fields of 
different arguments, we get a combinatorial factor 4! which will compensate the 
factor 1/4!. On the other hand, if two lines originating from y, end at the same 
point y,, the factors 1/4! of vertices y, and y, are only partially suppressed. 
In the case depicted in Fig. 6-1a, the factor is 1/2!. This results from 1/4! x 1/4! 
(initial vertex factors) x 4 x 3 (number of possible contractions z; — y1, Z2 — y1) 
x 4 x 3 (contractions z3 — yz, z4 — y2) x 2 (contractions between y, and yp). 
Similarly, in the case of Fig. 6-1b, the factor is 1/3!. More generally, the resulting 
symmetry factor is 1/S, where S is the order of the symmetry group of the 
diagram, i.e., of the permutation group of the lines, when the vertices are fixed. 
We summarize the Feynman rules for the T product (6-17): 


1. Draw all distinct diagrams with 2n external points x,,...,X2, and p vertices 
V1,-++> Vp, and sum all the contributions according to the following rules. 


Vi aD 


(a) (b) 


Figure 6-1 Examples of diagrams with a symmetry factor: in case (a), S = 2!; in case (Oys = 3 
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(a) (b) (c) 


Figure 6-2 Low-order contributions to the two-point function in g* theory. 


2. To each vertex attach a factor —i/. 
3. To each line between z; and z; attach a factor ¢(z,)p(z;) given by (6-16). 
4. Each diagram has to be divided by a symmetry factor S. 


As an example, let us examine the two-point function (— iA)? <0] Te(x1)e(x2) 
X1)@ 


:97(y1):/4!--+: p*(yp) 2/4! [0D to low orders. For p = 0, we get 9 (x2), while 
there is no p = 1 contribution, since we forbid contractions ~(y)@(y). For p= 2, 


there are three contributions, depicted in Fig. 6-2 


—iy — care 4 
(a) ( 7; P(X1) (x2) Le(v1) P(y2)] 
(— id)? aie —S ; oc 
(b) 31 e(xi)o(y) Lely 1)P(y2)] P(y2)P(x2) 
(c) The same with y; > y2 


Let us emphasize the importance and the meaning of the word “distinct” in 
rule 1 above. In configuration space, diagrams are considered as distinct when 
they are topologically different, all the points x; and y; being fixed. For instance, 
diagrams (b) and (c) of Fig. 6-2 are distinct, whereas diagrams (a) and (b) of 
Fig. 6-3 are not; in the latter case only one of them has to be taken into account. 

Let us divide the diagrams into two classes. The first class contains no 
vacuum-vacuum subdiagram, i.e., subdiagrams (like the one of Fig. 6-2a) which 
are not connected to the external points; the second class includes such sub- 
diagrams. To each diagram of the first class, we may associate a set of second- 
class diagrams, by addition of vacuum-vacuum subdiagrams. Denoting by the 


xy ¥3 x3 xy a x3 
X4 Va X4 x4 V3 X4 
(2) (b) 


Figure 6-3 Example of indistinguishable diagrams; only one of them has to be taken into account. 
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superscript (1) the first-class contribution to the Green function, we may write 


Giza, X25) \0| Lexp { [ax ia too |o> 


oo jP Pp 
i a p! {a vir d*yp » (?) <0 TP(X1)-** O(X2n) Fint(1)* + Line (Je) [OO 
x <0| TLin(Ve+1)°°* Aah Vs) |0> (6-18) 


since the (p — k)th-order vacuum-vacuum subdiagram corresponds to the vacuum 
expectation value of the time-ordered product of p — k interaction lagrangians. 
Moreover, the combinatorial factor 


p\_ p! 
k} k\(p—&)! 


results from the fact that, after integration over y,,..., yp, all possible permutations 
of the y give rise to equal contributions. It is easy to see that the vacuum 
amplitudes ¢O| T exp [i J d*x Lin.(x)]|0> factorizes on both sides of Eq. (6-18) and 


G(X), a ae X =n) 
20 jP 
= 5 A fay + d* yy 0| Te(x1)*+* @(xXan)Lins(y1)°** Lint(¥p)|O> (6-19) 
so 
Only diagrams without vacuum subdiagrams contribute to the right-hand side 
of Eq. (6-19). 
It may be simpler to give the Feynman rules in momentum space, ie., to 
consider the Fourier transform G: 
d*p, — d* Pan =a ~ 
Ge), .. ee) = ene j :* x; Pee Ri 
1, . . ean) | Qn? Qme oP iD Pi s) G(P1,---s Pan) 


(6-20) 


2n 
“G(D4,-- ->Pan)= [ate Gd" eae EXD (- y nsx) CCS ee 
j= 


With our conventions, all the p are incoming momenta. Translation invariance 
entails momentum conservation ) {" p; = 0, which implies 


G(p1,--+» Pon) = (22)*54(py + °** + Pan) G(p1,--+> Pan) (6-21) 


For simplicity we use the same symbol in x or p space. We have to integrate 
over all space-time points x and y to get G from (6-19) and (6-20). Using the 
Fourier representation (6-16) of the elementary contraction, the integration over 
x; produces a contribution i(p? — m? + ie)~ 1 for each external line. We are left 
with the integrations over the y; and the k,, which are the momenta running 
through the internal lines. Let us concentrate on the contribution attached to a 
given configuration of lines and vertices for fixed internal momenta and given 
external configuration arguments x,,...,Xx2,. Any permutation of the labels 
Yi,---5 Vp Of the internal vertices yields the same contribution. However, not all 
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of the p! permutations of the y need generate topologically distinct diagrams 
in the original configuration space. If this were the case we would get a factor 
p! compensating the factor (p!)”’ appearing in Eq. (6-19). It may happen for 
some diagrams that two (or more) vertices play identical roles (this is the case 
in the example of Fig. 6-3a where y3 and y4 are such vertices). If the p vertices 
are divided into groups of v1, v2,...,Vs vertices playing the same role (v, + °° 
+v,=p), the integration over the y variables yields a degeneracy factor 
p!/v,!---v,!. This compensates only partially the (p!)* factor of Eq. (6-19) and 
leaves a vertex symmetry factor 1/v,!---vs!, which multiplies the line symmetry 
factor discussed previously. For instance, the diagram of Fig. 6-3a has, in 
momentum space, a global symmetry factor 4 4. 
Finally, every y integral is of the form 


[ay eM” t= (Deel O 145) 


where q; denotes the sum of all external or internal momenta flowing into the 
vertex y;, Momentum is conserved at each vertex, and is thus globally conserved. 

Feynman rules in momentum space may now be written, for the Green 
function G(pj,..., P2n), a8 follows: 


1. Draw all topologically distinct diagrams with 2n external lines of incoming 
momenta pj,...,P2n and without vacuum subdiagrams. For each diagram, 
denote by k,,...,k; the momenta of internal lines. In a scalar theory without 
derivative coupling, the choice ofan orientation of the internal lines is irrelevant. 

2. To the jth external line, assign the factor i/(p7 — m? + ie). 

To the /th internal line, assign [d*k,/(27)*][i/(k7 — m? + ie)]. 

4. To each vertex, assign (—iA)(2m)*6*(q;) where q; is the sum of all incoming 
momenta at vertex j. 

5. Integrate over the k variables the product of all these contributions and 
divide by the symmetry factor of internal lines and vertices of the diagram. 

6. Sum the contributions of all topologically distinct diagrams. 


ee 


Generally, if V is the number of vertices (called p previously) and J the number 
of internal lines, there are V — 1 conservation rules among the k after separation 
of the global conservation (6-21), and thus at most J — V + | nontrivial integra- 
tions. 

The Feynman rules must be slightly modified when some external lines are 
connected to no vertex at all. For instance, the single propagator of Fig. 6-4 
gives in configuration space the contribution 


ap Saas e” ip: (x, — x,) 
(2n)* p* —m’ + ie 


and in momentum space 
i 
+5 (2n)* 5* 
es (Px + pr) 
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e—_—______ _____@ Se 
—_ = 
XE x) DP; DP; 


(a) (b) 


Figure 6-4 Diagrams with a short-circuited propagator, in configuration space (a) or in momentum 
space (b). 


Let us illustrate these rules to lowest order for the two- and four-point 
Green functions. Factoring out the global momentum conservation 6 function, 
the contributions of the diagrams of Fig. 6-5 to the two-point function G2(p, — p) 
read 


1 
p? — m? + ie 
3 


(b) (—iA? ( i i d*k, d*k, 
3! \ p* — m? + ie (2n)*(2n)* 
i 


* (ke — m? + ie\(k3 — m? + ie)[(p — ky — ka)? — m? + ie] 


(a) 


Similarly, the contributions from the diagrams of Fig. 6-6 to G4(pi, P2, ps, Ds) 
(with pi + p2 + ps + pa = O)are 


5*(p, + po) 6*(p1 + ps) 

“2 4 

ee lz Sm? + ic)(p} — m? + ie) * (ph — m? + ie)(p} — m? + i) 
ae | 
(pi — m? + ie)(p3 — m* + ie) 


— = 
p 


aE p ~P 


(a) (b) 


Figure 6-5 Low-order contributions to the two-point function in momentum space. 
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Pa P2 Py P3 Py ee Py P3 
—s —_— — —— —_ —_ 
+ + 
— — — — — — P2 P4 
P3 P4 P2 P4 P2 P3 
(a) (b) 

Py k P3 Py P2 Py P3 
P2 Py +P. —-k P4 P3 P4 P4 P2 
(c) (d) (e) 

Figure 6-6 Low-order contributions to the four-point function. 
(b) i*(—id) 

(pj — m? + ie)(p3 — m? + ie)(p3 — m* + ie)(p4 — m? + ie) 
(c) rl (ae 

2 4 pj —m* + ie 


1 
(2n)* (k? — m? + ie)[(p1 + p2 — k)? — m’ + ie] 
(d) or (e) Same as (c), with p2 <> p3 or p2 > p4 


In the computation of the corresponding S-matrix elements the external propa- 
gators i(p? — m* + ie)”' are just compensated by the i((J,, + m*) operator of 
the reduction formulas (5-28). In the latter, the last integration over x; identifies 
the incoming momentum of the Green function as defined in (6-20) with the 
physical on-shell momentum of the S-matrix element. Therefore, to get the con- 
tribution of a given Feynman diagram to the S matrix, we omit the external 
propagators and put the external momenta on the mass shell. As we shall see 
later, this rule must be supplemented by mass and wave-function renormalizations. 

When a diagram is disconnected, the Green function G still contains 6 func- 
tions expressing the conservation of a partial sum of external momenta. In 
Chap. 5, we have defined the connected Green functions G, by the recursive 
expression 


Goer. %)= F TT Gber) (6-22) 


where / = {1,...,n}, x; = {X1,...,Xn}, together with the initial condition G(x,, x2) 
= G.(x1, X2) (for the y* theory). The diagrams without the vacuum subdiagram 
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Figure 6-7 A “tadpole” diagram. 


involved in the perturbative expansion of G(x,, x2) are (topologically) connected. 
More generally, it will be shown by induction that only connected diagrams 
contribute to G.(x1,...,X2,). Let us assume that this is true up to the (2n — 2)- 
point function, and let us rewrite (6-22) as 


Gite aa) a Salt, - aan) a y I] G.(x1,) (6-23) 
UI,=I « 
Ja|>1 
After substitution of the diagrammatic expansion for G(x1,...,;Xa,_) and for each 
G, in the sum of the right-hand side, each contribution from a disconnected 
diagram to G(x;,...,X2,) may be identified with a term of the sum, and vice 


versa. This shows that the algebraic definition of connectivity [Eq. (6-22)] may be 
identified with the topological one. 

Let us return to the role of the normal ordering of the interaction lagrangian 
in (6-14a). Had we kept the ordinary product Vint = —A/4! p*(x), new diagrams 
would have emerged, namely, those with contractions of fields pertaining to the 
same vertex. For instance, the “tadpole” diagram of Fig. 6-7 would appear in 
the two-point function. In certain circumstances, it may be more convenient to 
keep the ordinary product and to deal with tadpole diagrams. This is, for instance, 
what happens when we want to perform a local inhomogeneous transformation 
on the field (a translation, say), while keeping some symmetry properties. For 
the y* theory, passing from one ordering to the other simply amounts to a 
change of the mass term. The latter change is actually infinite, but may be 
included in the mass renormalization that we shall study soon. 

After this derivation of the Feynman rules for a self-interacting scalar field, 
we shall now do more rapidly—the same job for more elaborate and physically 
more interesting theories, namely, for spinor and scalar quantum electrodynamics. 
Other cases involving fields with an internal symmetry will be also encountered 
in the following chapters. At any rate, the general method for deriving Feynman 
rules should be clear. In any doubtful situation, in particular when symmetry 
factors are dubious, it may be safer to return to the starting point, that is, 
Eq. (6-10) and Wick’s theorem. 


6-1-2 Feynman Rules for Spinor Electrodynamics 


The total lagrangian describing the interacting system of photons, electrons, and 


positrons reads 
Pot Lo + Lin (6-24) 
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where 


1 pe? A 
Y cee ree od ee eA le 
a ra 5 (0 A) 


£5 = ST — mip (6-25) 


ee int aa Ss ewy'WA, 

The various terms have already been encountered in Chaps. 3 and 4. In the first one, 
we introduce a massive photon of mass much smaller than the electron mass m, 
and an indefinite metric is needed. The interaction term results from the minimal 
coupling prescription id, — id, — eA, in the electron lagrangian. 

We shall use distinct notations for the two types of propagators involved 
in the contractions. The electron-positron propagator will be denoted by a solid 
line, oriented in the direction of the charge (e) propagation: 


as = 60 TWalx)V p(y) |0> = Val) Vey) 


xX, Of ys 
d*k ik + (x~—y) i 
= — tk + (x 6-26 
| ie K—m+ ié/ag ( ) 


As is well known, this propagator is not a symmetric function of x and y but 
satisfies Eq. (3-176). On the other hand, the photon propagator will be marked by 
a wavy line: 


fnapeer) 

oe, = 0] TAG) As(9)]0> = Alo) Ad(y) 
ak ie -ts- ny (oe — Kok ol kyk p/p? 

= \& ‘ (—?) k?— p+ ie | K2—M? +e (6-27) 


To avoid confusion with the electron mass m, we have set M? = p?/A. Notice 
that A and M? are not expected to appear in the physical results. 


Because of charge conservation, Green functions have an equal number 
of w and w fields: 


Giese 5 Xn Xe eee eons yp 
= 0 TYW(X,) 7 W(Xn)W(Xn+ ue W(Xan)Av,(y1) i A, (Yp) |o> (6-28) 


For brevity’s sake, we have omitted spinor and vector indices in the arguments 
of G. As in Eq. (6-14), we derive an expression for G in terms of in-fields: 


Gey ee Vi vay) 
<0| TWin(X1) oo Win(X2n)Ai(y1) ae AY (yp) exp i [ats L int(Z) |0> 


(6-29) 
<0| T exp i | d*z Liu(z)|0> 
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where Pint(Z) = Lint[Win(Z), Win(Z), Ain(Z)] is supposed to be normal ordered. 
The “in” indices will be omitted in the sequel. Expanding (6-29) in powers of e 
and using Wick’s theorem leads to Feynman diagrams built out of the propagators 
(6-26) and (6-27) and a vertex 


© = —ie(Yy)pe (6-30) 


Co B 


This vertex possesses one vector and two spinor indices which are contracted 
with the corresponding indices of the photons and fermion propagators. As in 
the scalar case, the role of the denominator in Eq. (6-27) is to eliminate the 
vacuum-vacuum subdiagrams. 

Let us collect the signs introduced by the Wick rules for fermions. From 
charge conservation, there are two kinds of fermion lines in the diagram, closed 
loops or open lines ending at a point x;(1 <i<n), and coming from a point 
x(n + 1 < k; < 2n). A closed loop is composed of a sequence of propagators 

moet (Graal (Tal 


W(z 1) (za) (za) (zi) - + (Zaz) (6-31) 


between fields appearing in interaction lagrangians. Since the latter commute 
under the T symbol, we may reorder the T product as 


<0] Te (Wie A(z) W(21)] [Wed ACW) LY (Za) Az W(2q)] + * |0> 


without introducing any sign. The product of contractions (6-31) is obtained 
after the permutation of y(z,) with an odd number of fields. Therefore, a minus 
sign is to be associated to each fermion loop. 

On the other hand, the open lines define a permutation of the points 
Xj X1Xp,X2Xk, °°’ XnX~, Where we recall that x,, is the origin of the line ending at 
xp. There is then an extra sign, which is the signature of the permutation 


(1, 2,..., 27) — (1, k1, 2, k,...5 %, Kn) 


Diagrams which differ only by the orientation of a fermion loop contribute both 
only when they are topologically distinct. For instance, the two diagrams of 
Fig. 6-8 are clearly identical; only one of them contributes to G(y1, y2). On the 


(a) (0) 


Figure 6-8 Example of identical diagrams in spinor electrodynamics; only one of them has to be 
retained. 
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yy y2 yy y2 
zy 29 Z1 = 
Z4 Z3 Z4 23 
V4 y3 V4 Ys 
(a) (b) 
yy Ve Vi v3 Vy V4 Vy Yo Vy v3 va 
y2 Via Vas ¥2z 2S O3 v4 Va Yo Va 
(c) 


Figure 6-9 The four-photon amplitude to lowest order. The diagrams (a) and (b) are not identical in 
configuration space. After summation over the z variables, we are left with the six distinct diagrams 
depicted in (c). 


= 


contrary, the first two diagrams of Fig. 6-9 contribute both to the four-photon 
function (photon-photon scattering). They correspond to the two distinct sets of 
contractions: 


— SS le aan 
W(z1) A(z1)W(z1)W(Z2) A(z2)W (22) (23) A(z3)W (23) (za) A(Za) (Za) 


(a) 
and 
SE SS ee 
(b) W(z1)A(z1)W (21) W (22) A(z2) (22) (23) A(Z3)W (23) W(Z4) A(za) (Za) 


where A(z,) is contracted with A(y,), etc. In configuration space, the four-point 
function G(y1,...,y4) receives contributions from 4! x 3! distinct diagrams, 
obtained from those of Fig. 6-9a through permutations of z,, 22, 23, Z4 and yo, y3, ya; 
the diagram of Fig. 6-9b, viz, is derived from Fig. 6-9a by permuting y2< ya, 
Z2++z4. When carrying out the z integrations, the 4! permutations over the z 
give a factor that compensates the 1/4! coming from the expansion of 
expif dz Y,,,(z). We are left with six distinct diagrams (and no factor) (see 
Fig. 6-9c). 

The representation in momentum space is obtained by Fourier transformation. 
For a connected function, we set 


2 


V3 
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can. ehalaiaaana 
2n ae d* : 2n Pp 
j : : 
rem @ Be ; Pe ri 


x _—. Gp) (6-32) 


where all momenta are > incoming. Let us now summarize the Feynman rules 
for the computation of G,. Draw all possible topologically distinct diagrams and 
associate the factors enumerated in Table 6-1. Then contract all spinor indices 
along the fermion lines (and thus compute a trace for each closed loop) and all 
vector indices along the photon lines. Finally, carry out all the integrations over 
internal momenta. 


Table 6-1 Feynman rules for spinor electrodynamics 


1. External lines 


Incoming fermion 


a B i 
Outgoing fermion (ane, 
<+—_p 
d p (Gov — %p9/K qo4v/ ) 
Incident photon q <3 i( Sense @ - M2 in 


2. Vertices 
: Lu 
—ie(y,)ga(2n)*5*(S, incoming momenta) 
a B 
3. Propagators 
P 8 d*p ( i ) 
. (2x)* \p—m + ié/ pa 
k d*k 9vp — Kuk p/p? kyk p/p 
-————» ieee, ee 
“ ‘ SE (9 (ene 5 a 


4. A minus sign for every closed fermion loop and a global sign, depending on the configuration of 
external lines, computed as indicated above 


276 QUANTUM FIELD THEORY 


Figure 6-10 A diagram of spinor electrody- Figure 6-11 Two diagrams with opposite 
namics without any symmetry factor. orientations of the spinor loop. 


For a diagram of order r (diagram with r vertices), the integrations over the 
r variables z and their permutations cancel the factor 1/r! coming from the 
expansion of exp i J d+z Lin1(z). Therefore, with this set of rules, when only topo- 
logically distinct diagrams are taken into account, no symmetry factor occurs in 
fermion electrodynamics. The reader is invited to compare the diagram of Figs. 
6-84 or 6-10 with the analogous scalar diagrams of Figs. 6-6c or 6-3a. The former 
have no symmetry factor when a single orientation is selected for each fermion 
loop; the latter have weights respectively of 1/2! and (1/2!)’. 


Furry’s theorem All the diagrams containing a fermion loop with an odd number of vertices may 
be omitted in the computation of a Green function. Indeed, the two orientations of the loop (Fig. 6-11) 
lead to contributions of opposite sign. To show that, we write the contribution corresponding to the 
first orientation as ~ 


Gy =tr [u,Sr(Z1, Z5)¥u,5 F(Zs, Zs- ee a Vu, SF(Z2; z1)] 
and we appeal to the existence of a matrix C satisfying [compare with (3-176)] 


CS;(x, y)C~* = SF(y, x) 


Cy,C>* = —y,7 oe 
in order to insert CC~ ' between propagators and rewrite 
Gy = (—1) tr [yt SF(Ze 21) 9 SF(Zs~ 1, 25)77,_, °° V7, SE(z1, Z2)] 
= (—1F tr [py Sr(Z1, 22) Yu." * Yu, Se(Zs, 21) (6-34) 


Up to the sign (—1)*, this is exactly the contribution of the second orientation. For odd s, the two 
contributions cancel, which proves the statement. 


6-1-3 Electron-Electron and Electron-Positron Scattering 


As an illustration of the diagrammatic machinery, let us compute the cross section 
for electron-electron scattering to lowest order. We start from the reduction 
formula derived in the previous chapter: 


-\4 
—}; Sfienll otal eed = 
Sy = (Hr) [ax d* x2 d*x), dtx'g ellPi 1 +P2*x2—pi+x1— pa x2) 
Z 


x Up's, €1)(i9x, — m)a(p2, €2)(iPx, — m) <O| TW(x)W(xd)U(x1) (22) (O° 
x (—i Ge, — m)u(Pr, &:)(—i 9, — m)u(po, &2) (6-35) 
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Pir P1>€ 


oD 


: Figure 6-12 General form of the diagrams contributing to electron- 
, : 
P2,€2 P2,€2 electron scattering. 


The notations are summarized on Fig. 6-12; notice that here p{, p> are the out- 
going electron momenta. To lowest order, the two diagrams of Fig. 6-13 contribute 
to the Green function, and Z, = 1. The Dirac operators and the integrations in 
(6-35) compensate the factors attached to the external lines and put the external 
momenta on the mass shell. Use of momentum conservation at every vertex 
implies in this case that no integration is left. The two diagrams have a relative 
minus sign and 


Spi = (2n)*5*(p, + p2 — pi — p2)(—ie)? 


yp — KDR? kL? 
q | =a, 27 lps.eaN—9 (2 : ae o | te 


roam fo pee) U(P2, 2)y?u( Po, £2) 


Ivp — kk) / 2 ke) 2) / 1? 


+ ii(p, &2)y"u(pi, €:)(— a( KOE y2 set | u(p', &41)y?u(p2, “3 | 


(6-36) 
where b= pe— Pi pa P2 
Ki?) = py — P2 = Pi — P2 
Therefore, 
k\Y a(p', &s)y’u(p1, €1) = Up, €1)(P1 — Pi)u(Pr, &1) = 9 
and ki? a(p2, &2)y"u(p1, €1) = W(p2, €2)(P1 — P2)u(p1, &1) = 0 


The terms in k,k, in the propagators give no contribution, and in the computation 
of the elastic cross section we may take the » = 0 limit without encountering 
any infrared divergence: 


So; = —ie?(2n)*5*(pi + p2 — Pr — DT 


Pt Py P? Py 
KD = py — Pi Piers 
Ps P2 Pj P2 


Figure 6-13 Lowest-order contributions to electron-electron scattering. 
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pi , p’) 


P, (E,p) eee / p,(E,— p) 


Figure 6-14 Kinematics of electron-electron 
p,@,—p’') scattering in the center of mass frame. 


G7 — UP, &1)y"ulP1, €1)m(P2, €2))vu(D2, €2) 
(p1 — pi)? 
_ U(p2, &2)y"u(D1, €1)U(D', €1)vU4(D2, €2) 
(pi — p2) 
The antisymmetry of the initial or final states is now manifest. There is no factor 
due to the identity of the particles, but of course the total cross section will 
be obtained by integrating over only half of the final phase space. 

Let us compute the differential cross section for unpolarized initial beams, 
when the final polarizations are not observed. The kinematics of the reaction in 
the center of mass frame is represented in Fig. 6-14, where @ is the scattering 
angle in this frame, the energy E is conserved, and we denote |p| = |p’| = p= 


./E* — m?. Using the general formula (5-13), with due care paid to the fermion 
normalization, we get 


[(p1* p2)? — m*]"? J (2n)° EB; (2n)*° E 


with 


(6-37) 


(27)*6*(p, + p2 — pi — pa)e* \Fi\ 
(6-38) 


In the square modulus of the amplitude, an average and a sum over initial and 
final polarizations are understood. After integration of the energy momentum 
conservation 6 function 


do m?e* 


oa 4E?(2n) |7 |? (6-39) 


fi tf fe 
=F f(r ee BE tm yp tm : 22 
4 2m 2m 2m 2m / |(pi — Pi) ] 
ptm potm,, pe +m ee 1 
2m °° Im - 2m 2m / (pi — p1)*(p2 — pi)” 


+ (pi =r} (6-40) 


= el] Ns 
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The traces are readily evaluated 
tr [yi + m)y(B1 + m)] = 4(P1vP 1p — voP1 "Pt + PipPiv + mg,,) 
tr [y(pr aie m)) (Pt te m)| tr [y’(p2 =r m)y?(p> =F m)| 


d = 32[(pi*p2)? + (pi * p)? + 2m? (pi + p> — pr‘ pa) 
an 


Yv(B1 + m)yo(p2 + m)y’ = —2p2y,~1 + 4m(p2, + Pip) — 2m’y, 
tr [yo(P1 + m)y (po + m)y’(p2 + m)y?(p, + m)] = —32[(p1- p2)? — 2m? p, -pa| 


Therefore, 


Ea 1 (p1°P2) + (pi p2)? + 2m? (pi * po — pi p2) 
2m* [(p' aa p1)*|? 
_ or *p2)? + (pr * pi)? + 2m? (pi - pi, — pi* po) 
[(p2 — pi)"]? 


50 


z 2 9 2 “n> 
(p1°p2)? — 2m? py | (6-41) 


(Pi — pi)*(p2 — pi)? 
We may express all invariants in terms of the energy E and scattering angle 0: 
P1*P2 = 2E* — m? 
Pi°P1 = E*(1 — cos 9) + m? cos 0 
P1°p2 = E*(1 + cos 6) — m? cos 0 
We finally obtain the M@ller formula (1932) 
2) 2 2)\2 var ae 
In the ciarcwevintte limit, where m/E > 0, 


= eer 
E* \an"O @em-6 4) 4&7 \sin* 6/2  cos* 6/2 
(6-43) 


ur 


and in the nonrelativistic limit, E? ~ m?, v? = (E? — m?)/E’, 
aes). | eee ‘in SS 
dQ|,, \m/ 4v*\sint 6 sin? 0 
o\- mel gs 1 1 
= {— oo 6-44 
(=) 1604 (a ieee ant aston) = 
a result obtained first by Mott (1930). We let the reader reexpress these cross 
sections in the laboratory frame. It is also instructive to compare (6-44) with the 


nonrelativistic limit of the Rutherford formula for Coulomb scattering, where we 
set Z = 1 with a reduced mass equal to m/2. 
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Electron Electron 
U 
Py P\ 
q1 q) Figure 6-15 General form of the diagrams contributing to 
Positron Positron _ electron-positron scattering. 


Let us now consider electron-positron scattering. The kinematics and lowest- 
order diagrams are depicted in Figs. 6-15 and 6-16. Polarization indices are 
omitted and in Fig. 6-16 four-momenta are oriented according to the charge 
flow. The scattering amplitude may then be obtained from (6-37) by substituting 

u(p1) > u(p1) Pi P1 
u(p;) > u(pi) = pi > PA 
u(p2) > v(q1) pa —ai 
u(p2)> 0(41) pa >—- Hh 


(6-45) 


and by changing the sign of the amplitude. The center of mass cross section is 
readily computed: 


do m’* e* 


a ae = || a2 
io ~ aon? !7 | 


with 
\7/? = a 1 qi)? + (pit qi)? — 2m? (pi-qi — Pt *q\) 
2m [@1 — pi)? ]? 
a: (p1°q1)? + (pi pi)* + 2m? (pi > pi + pi‘ qi) 
[(pi + qy|’ 
(p1°qi)? + 2m*p, ai 
+2 ————_._—_——_————— 6-46 
(pi — pi)(pi + 41)” Ca 
Py Py P} Py 
Py =) 
=4; ~qy 


=a) mi 


Figure 6-16 Lowest-order contributions to electron-positron scattering. 
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It is then straightforward to write the expression of the cross section: 
ia 3 _ 8E* — m* (2E? — m?)? 
7) 2E@\4 Geeta) (femcos0) Q(E? an". — cos 6° 
i. 2E*(— 1+ 2cos 6 + cos? 6) + 4E?m?(1 — cos 6)(2 + cos 8) + 2m* cos? ‘ 


16E* 
(6-47) 
The ultrarelativistic and nonrelativistic limits are respectively 
date a? [1+ cos*6/2 1 cos* 6/2 
| a ee eee 2) faa) et 
7. =| a sin oe | ie 
de (a 1 
dQ,  \m/) 16v* sin* 6/2 Cn”) 


Notice that the annihilation diagram does not contribute to the latter case. These 
expressions are due to Bhabha (1936). 


The results of Eqs. (6-42) and (6-47) may be compared with experimental data. At low energies we 
show in Fig. 6-17 some data of Ashkin, Page, and Woodward (1954) for electron-electron scattering 
at 90 degrees. Mdller’s formula (6-42) gives a good agreement, in contrast with the corresponding 
expression [to be derived below, Eq. (6-65)] which would apply to spinless particles. Results for 


> : 
a B 
o = Oe 
S g 
3 > 
a 3 
& & 1.0 
0 
0.5 1.0 1.5 0.6 0.7 08 09 1.0 
Ey,» (MeV) Ey ap (MeV) 
(a) (b) 


Figure 6-17 Experimental data for electron-electron and electron-positron scattering at 6 = 90° as a 
function of the incident electron energy in the laboratory frame. (a) Electron-electron scattering. 
The solid line represents the Mller formula, the broken one the Mller formula when the spin 
terms are omitted. (b) Electron-positron scattering. The solid line is the Bhabha formula, the broken 
one the prediction when annihilation terms are deleted. (From A. Ashkin, L. A. Page, and W. M. 
Woodward, Phys. Rev., vol. 94, p. 357, 1974.) 
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Figure 6-18 The cross section for Bhabha scattering at high energy, for scattering angle 45° < 6 < 135° 
as a function of total energy momentum square s. The solid line is calculated from quantum 
electrodynamics with first-order radiative corrections. (From K. Strauch in “Proceedings of the Sixth 
Symposium on Electron and Photon Interactions at High Energies,” edited by H. Rollnik and W. Pfeil, 
North-Holland, Amsterdam, 1974.) 


electron-positron scattering are also fitted quite well by Bhabha’s cross section. Omission of the 
annihilation term would spoil this agreement. The energy of the incident particle in the laboratory 
frame plotted on the abscissa is chosen in the intermediate range where neither the nonrelativistic 
nor the ultrarelativistic approximation is valid. The numerical values show a significant departure 
from the ratio 2:1 between ee” and e~ e* cross sections, expected on the basis of a naive argument 
of indiscernability of the two electrons. 

The predictions of these lowest-order calculations may be amended by radiative corrections (see 
Chap. 7) to allow a comparison with more recent data taken at the extreme relativistic energies of 
the electron-positron storage rings. For square center of mass energies in the range 2 to 9 GeV? 
the agreement between theory and experiment is spectacular, as demonstrated in Fig. 6-18. The 
data are from a group working at the Adone machine in Frascati. 


6-1-4 Scalar Electrodynamics 


As a last illustration of the derivation of Feynman rules, we will now present 
the case of scalar electrodynamics, 1e., electrodynamics of charged spin-zero 
particles. It allows us to compare the predictions of this theory for basic processes 
with those of fermionic quantum electrodynamics, and therefore to understand 
the role of spin in the latter. On theoretical grounds, scalar electrodynamics is an 
interesting case, since its interaction lagrangian contains derivatives and differs 
from the negative of the interaction hamiltonian. 
We start from the lagrangian of a free charged scalar field 


Lo(9) = 6,01 d"p — moto (6-50) 


(caution, no factor 4!) and perform the minimal substitution Op — (0, + ieA, Je. 
Adding the electromagnetic lagrangian ¥%(A) of Eq. (6-25) yields 


L = Lo) + LA)+ Lm (6-514) 
Lin = —ieA"(o'G,,9) + e2 A2o'@ (6-51b) 
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Since the interaction lagrangian contains derivatives, the canonical momenta are 
modified : 


OL 
pe << ne’) -, 40 
T 509") 0g + ieA°C 
ae (6-52) 
t= —_— = 0°@f — ieA%gt 
aon) ° * a 
The hamiltonian reads 
H = Ho + Ho(9, a) + Hin 
0 0 t (6-53) 
Ho(¢, =) + Hin = [es (n@ + at ot — Lo — Lin) 
Here Hp is the free hamiltonian of Eq. (3-76): 
Ho(¢, 2) = [as (x' + O.0' O.p + m? oto) (6-54) 


and Hip, takes the form (the spatial index k takes the values 1, 2, 3) 
Ege ar y= [ax [ieA°(x' pt — ng) + ieA"(pt Og) — e? AX A,9'Q] 
(6-55) 


If we substitute (6-52) into (6-55), we observe that Hin(o) = —Lin(A, 0) — 
e? A°* pt. On the other hand, we may perform a canonical transformation to 
the interaction representation; @, z, A are changed into @in, Zin, Ain, ANd Tin = 
Aoo},. Thus, in this representation 


> 
H int = ieAQ (noi, ad TinPin) aia ieA* oh, On Pin ae e? Ainn At Din Qin 


; = ieAt ot, 0, Pin — e” AZ. of, Qin =} e? Ao? of Pin 
eae ae LigtAin, Pin) ce e? Ap? Oi, Pin P (6-56) 


The hamiltonian and minus the lagrangian differ by a noncovariant term (notice 
the change of sign in front of this term) which seems to jeopardize the covariance 
of the theory. 

Consider, however, a typical Green function of the theory 


Cae - . 265 SR, . 5 X05 V15+--s Vp) 


= (O| Te(x1)*** (<n) 9"(Xn+ 1)°** G*(X2n) (V1) A(yp) [0D 


O| T@in(x1)°** Ain(Yp) exp E {ats Fuld |0> 


- <0| Texp| —j | d*z Hn ({Z) o> 


(6-57) 
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There is a second source of noncovariant terms in the chronological products 
of derivative terms. To end up with covariant results, we expect these two types 
of noncovariant terms to cancel exactly. This is indeed what happens. 


We give a formal proof of this property, since we sidestep the problems connected with the ultra- 
violet divergences of the perturbation expansion. 

Let us look more closely at the noncovariant contractions induced by the derivative terms. In 
the following, the “in” indices will be dropped. We list the relevant propagators 


tes 4k ew heey) 


t(y)l0> = t = —— 6-58 
(0| Te(x)e"(y)|0> = <0] Te(y)e"(x) 07 | eg, en (6-58a) 
<0! T8,0(x) "(y) |0> = 8% <0| Te(x)e*(y) |0> (6-58b) 
<0| TA, 0(x)d,9"(y) |0> = 028% <O| Te(x)—"(y) |0 — iguogvod*(x — y) (6-58c) 


Since the g propagators are the only ones with noncovariant terms, we may consider the field 
A as classical in the following argument and handle all Green functions of » and ’ by introducing 
a generating functional 


Z(j) = <O| Texp —i [ate [#in(y) + i*(y oy) + JO)E"(y)] [> 
= ¢0|T r= [ate -+-d*z, AM(z1) (21) Oy, 9(21) A%(2,)e'(z,) 8, o(2,) 
x exp —i [ae-2s°o%o + e? A” ot + j*@ + jo*)|0> (6-59) 


Only the derivative part of the interaction hamiltonian has been expanded in powers of e, since it gives 
rise to the noncovariant contraction (6-58c). We introduce a further notation: the covariant chrono- 
logical product T coincides with the T product of Eqs. (6-58), but for the last term of (6-58c): 


<0| Té,.(x)8,q1(y)|0> = 278% <O| Te(x)e*(y)|0> (6-60) 


c . ° = - r 
Let us examine the contribution of s noncovariant contractions —ig,og,o6*(x — ). There are (; \ 


r!/(2s)!(r — 2s)! choices of (2s) vertices among r, and (2s)!/2°s! ways of contracting them into s pairs. 
Each noncovariant contraction of two vertices gives 


eA"(x)p(x)[<0| TA,.0'(x)d.e(y)|0> — <0] T2,.e'(x)6, (9) |0>] (— A) A E"() + (Wg) 
= ie’ Ady!(x)y(x)d*(x — y) 
where we have implicitly assumed Wick ordering. Therefore, we obtain 


; Ee elt sacra aii 2 i(ieele s 
Z()) = <n a d*z' AX(z'\o'(z') d,e(2') | ¥ d*z” A&(z")p*(z")(z") 


s=0 s! 


x exp —i [ate(-etoto +e’ Ago'y + jot + j*@)|0> (6-61) 


We see that the noncovariant contractions may be exponentiated and their contribution compensates 
the noncovariant vertex e” Ajg'g. 


We may finally write, with a covariant T product (6-60), 
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Table 6-2 Feynman rules for scalar electrodynamics 


Photon oa (22 = Kok glu? kvkp/u? —\ d*k 
k k?—p?+ie  k? — M? + ie/ (2n)4 
B k i d*k 
ee = k* — m? + ie (2n)* 
k 
mw 
a —ie(p, + pi)(2n)*5*(p — p’ — k) 
p p’ 
K v 
k k’ 
ie? g,(22)*5*(p — p’ —k —k’) 
p p 


<0] T@in(x1)-** Ain(Yp) EXP i [ats £i,,(z)|0> 


G(x1,---, Vp) = (6-62) 
<0| T exp i | d*z Lin(z)|0> 


and a covariant lagrangian 


Lan a icAt ot, 0,,0in =F e? AZ, Qin Qin (6-63) 


The resulting Feynman rules are summarized in Table 6-2. The factor two in the 
second type of vertex (“seagull term”) comes from the fact that the quadratic 
term in A is A? and not A?/2!. Also remember that in scalar electrodynamics 
a symmetry factor 4 arises whenever a pair of photons is contracted between a 
pair of seagull vertices (Fig. 6-19). On the contrary, closed scalar loops being 
oriented give no symmetry factor, as long as only topologically distinct loops 
are taken into account. 


Eo Figure 6-19 A diagram of scalar electrodynamics affected with a symmetry 
factor 1/S = 4. 


286 QUANTUM FIELD THEORY 


ke, er kp €; Kr, €f ki, €; Ky, €f Kj, €; 


Pf DP; Pr P; Pr DP; 


Figure 6-20 Lowest-order diagrams for Compton scattering in scalar electrodynamics. 


We shall now review briefly some basic processes, already studied in fermionic electrodynamics. 


(a) Compton scattering 
With the same notations as in Sec. 5-2-1, we have 


d cea 

pad eT EC a 

aQ 4m? k; 
To lowest order, the three diagrams of Fig. 6-20 contribute, but if we choose polarization vectors 
satisfying 8; p; = &5°p; = 0, only the third one is nonvanishing : 

Ale = A(e,*6;)? 

do at? ky 2 (6-64) 
and —= as *¢;)? 

dQ m C) (er) 
to be compared with the Klein-Nishina formula (5-112). 


(b) Scattering of two identical charges 
We use the same notations and diagrams as in Sec. 6-1-3 and Figs. 6-12 and 6-13. In the center of 
mass frame 


do _ a? 
dQ 16E? 


\7/? 


~ + “D5 . sk ateat 
with inp =a] P2+ Pi°P2 , Pi*p2 t+ Pr 7 


(Pi — pi)? (Pi — p2)? 


a do - a? (2K? =") 2 E? — mn? 6-65 
dQ 4E? \ FE? — m? sin? 9 2E? — sn? oy 


to be compared with Eq. (6-42). 


(c) Pair annihilation into two photons 
We use the same notations as in Sec. 5-2-2 and the same diagrams as in Fig. 6-20: 
do «a m+ E, 
a0 ~ 4 (m+ Ea — pr cos 6p, | 


T/ (6-66) 
with | 7 |? = 4(e,- ,). If final polarizations are not observed 
mre | 23+) 4 
Sot = 7 — = aes ” in(y+ JP — | (6-67) 
il ae i); y? — i] va! 


with y = E2/m and ro = a/m. In the nonrelativistic limit, y + 1 and Stor ~ 2nr3/v2, that is, twice as 
large as the cross section ete” = yy. 


(d) Bremsstrahlung 
The same notations as in Sec. 5-2-4 are used; the diagrams are depicted in Fig. 6-21. We pick a 
polarization vector ¢ such that ¢-k = 0, e° = 0. Then the third diagram does not contribute: 
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—Ze (ie € k, € —Ze k, € =7e 


Figure 6-21 Lowest-order diagrams for bremsstrahlung in scalar electrodynamics. 


Sp= =te*(=Ze) 2n5(p9 +k? — p°) jee a Dy + k°) 2p; e(p? + py — k°) 
(py + k — pi)? (pp + k)? — m? (p; — k)? — m? 
eee Py @ dew dQ, dO (art Pie io) 
raed Pi q* x k é ; k Ip 

Summing over the polarizations 

Z*a3 ps 1 dw 
dO ynpol = (Qn Pi a dQ, dQ 

Payne 2 ear 2 : . ned 
g le py sin® Oy — 45? pi sin* 6; AR} 4 2pips sin 8; sin 8 cos p E, 
(E> — ps cos 85) (E; — pcos 6;) (E; — p; cos 0;)(E¢ — pr cos 65) 


(6-68) 


6-2 DIAGRAMMATICS 


The Born diagrams—or tree diagrams—encountered so far were very simple, 
since they implied no integration over internal momenta. In the general case, 
however, we have to deal with more complicated diagrams. This section will be 
devoted to a survey of the general properties and terminology concerning Feynman 
diagrams. For the sake of simplicity, most of this material will be presented in 
the case of a scalar theory of a single neutral field, except when explicitly stated. 


oe 


6-2-1 Loopwise Expansion 


The loopwise perturbative expansion, i.e., the expansion according to the increasing 
number of independent loops of connected Feynman diagrams, may be identified 
with an expansion in powers of h. By definition, the number of independent 
loops is nothing but the number of independent internal four-momenta in the 
diagrams, when conservation laws at each vertex have been taken into account. 
For a connected diagram, with J internal lines and V vertices, we have V 6* 
functions expressing this conservation, and after extracting the conservation of 
the incoming momenta, we are left with V — 1 constraints. Therefore, the number 
of independent momenta or loops is 


L=I-(V—1) (6-69) 


Notice that L is not the number of faces or closed circuits that may be drawn 
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Figure 6-22 The tetrahedron diagram has only three in- 
dependent loops. 


on the internal lines of the diagram. For instance, the tetrahedron diagram of 
Fig. 6-22 has four closed circuits but only three independent loops. 

To find the connection between L and the power of h, we collect all factors 
h. We leave aside the factor f that gives the mass term a correct dimension. 
In other words, the Klein-Gordon equation should read [@2 + (mc/h)*]o = 0, 
indicating that the mass term is of quantum origin. This phenomenon is dis- 
regarded in the sequel. There are thus two origins of such factors. First the 
commutation (or anticommutation) relations imply a factor h, for example, [ p(x), 
m(y)] = ihd3(x — y), which leads to a factor h in each propagator 

pas d*k ik-(x—y) ih 
ae) = |= kK? —m + ie 

Second, the evolution operator e~‘#*" contains fh explicitly, and so does 
exp [(i/h) J Lin(@in) d*z]. Thus, we have a factor fh for each propagator and h7} 
for each vertex. The total power for a diagram with E external lines is h’*?~" = 
h®-1*" For a fixed numbér of external lines, ie., for a given Green function, 
the announced result follows. 


In units where c = 1, a scalar field g has dimension [¢] ~ (energy/length)! 2, the coupling constant 
A of Lig = Ap* has dimension [f]~* ~ (energy x length)~', and a spinor field has dimension [yw] ~ 
(energy)‘/?/length. This gives the right dimension [/] to the actions 


= [ats|5 cor -3(7) ot + 04] or r= | aeo(a—“)y 


Of course, the lowest-order diagrams in f are the Born diagrams without 
any loop, such as those computed in the preceding section. 

The reader may then wonder why this topological loopwise expansion co- 
incides with the expansion according to powers of the coupling constant, in a 
theory with a single coupling constant. This is because in such a theory there 
exist auxiliary relations between V, the number of vertices (the power of J), and 
L. Take, for example, the g* theory; counting the total number of incident lines 
at each vertex tells us that 


4V=E+2I 
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for a diagram with E external lines. Eliminating I with the help of (6-69), we 
get L— 1+ E/2 = V(E is even). 


6-2-2 Truncated and Proper Diagrams 


We introduce some terminology that will prove useful in the sequel. 
The truncated functions are defined through the multiplication of Green 
functions in momentum space (without the 6* function of total energy momentum) 
by the inverse two-point functions pertaining to each external line: 


The two-point function G'”’ is referred to as the complete propagator. For p? ~ 
m, we have [G'°"\p, —p)]~! ~ (iZ)~'(p? — m2) where Z is the wave function 
renormalization introduced in Chap. 5. Hence, up to powers of Z, the on-shell 
values of these truncated functions are the quantities entering the reduction 
formulas. For instance, the connected part of the matrix element of Eq. (5-28) 
reads 


2 


2 epnoela gine = 7°" GO (— pi... —Dm Gis+++s Om) Ae 


trunc qj=m 


x (2n)*d*(Zp; — qj) 


The perturbative expansion of truncated functions is expressed in terms of trun- 
cated diagrams, 1.e., that have no self-energy part on their external lines. More- 
over, in the Feynman rules, no factor or propagator is attributed to the external 
lines (see Fig. 6-23 for illustration). Finally, if we restore the factors h, as indicated 
in the last subsection, L-loop truncated diagrams get a factor h’~?. 

We finally define proper or one-particle irreducible diagrams. Those are 
truncated connected diagrams which remain connected when an arbitrary internal 
line is cut (see Fig. 6-23). 

The proper functions, defined by their perturbative expansion in terms of 
proper diagrams, are the building blocks of perturbation theory, since the integra- 
tions over internal momenta may be carried out independently in each proper 
subdiagram of a given diagram. For the same reason, they play a central role 


(2) (b) (c) 


Figure 6-23 Examples of a nontruncated diagram (a), of a truncated but not proper diagram (b), and 
of a proper diagram (c). In cases (b) and (c), no factor is ascribed to the external lines. 
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in the renormalization program (see Chap. 8), since it is necessary and sufficient 
to make them finite to get rid of all ultraviolet divergences. 

Beside this topological definition, the proper functions may be defined 
algebraically. As shown by Jona-Lasinio, the generating functional of proper 
vertices is the Legendre transform of the generating functional of connected 
diagrams. The latter, denoted G,(j), has been defined in Chap. 5 as the logarithm 
of the generating functional of all Green’s functions 


ZO | Texp|i [ax ott) [o> = Gli) (6-7 1a) 


We proved in Sec. 6-1 that it corresponds indeed to connected Feynman diagrams 
with 


00 


GAj)= Di = | dtnasedta CG»... <a iam) (6-71b) 


n=1 


We now construct the Legendre transform of G, as follows. Let @,(x, j) be 
the functional of j defined through 


6 

POD = By Gl (6-72) 
and let us assume that the relation ~,(x) = ¢~,(x, j) may be inverted to yield j(x) = 
jc(x, @). This is possible at least as a formal series provided (6G,/5j)|;=0 = 0, 
which means that the one-point function vanishes and [67G,/6j(x)dj(y)]|j=0 # 0. 
This is what we assume in the following. The subscript c in @, aims at reminding 
us that @, is an ordinary c-number function, not to be confused with the quantum 
field o. 

The functional I'(@,) is by definition 


T'(.) = joa aa [as jedo.0o | (6-73) 


Kx) = jA{*,,) 


The factor i has been introduced for later convenience. From (6-73), it follows by 
differentiation with respect to @,(x) that 


0 OGAj) djAy, Pc) 
ia Pod = | atv] OO ¢ | + — if(x, @e 
0@-(x) diy) 2 j= ide) OPc(X) meee! 
The first term of the right-hand side vanishes, owing to (6-72), and 


_ Sg.) 

dQ-(X) 
As is well known in the analogous instances of classical mechanics or thermo- 
dynamics, the Legendre transformation is involutive. We also notice that iI (@,) 


may be regarded as the value of G,(j) — i J d*x j(x)@.(x) (j and ¢, independent) 
at its stationary point in j. 


Je(X; Pc) = (6-74) 
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We want to show that I'(g,) is the generating functional of proper functions 
Teioce; cages 


1 
I'(@,) = » a ars ae A \- aad OO B50) Xn) P(X1) ede P(Xn) (6-75) 


We differentiate Eq. (6-72) with respect to ~,(y) and get 


5 6 ere 
54*(x — y) = aly) Ee GAD ues | 


-, | jtz icles Ge) _9°Gel J) 

dy) dj(z)dj(x) 
y | 2 TG) __ FPG) 
59(Z)OMAyY) 9j(z)dj(x) 
where in the last equation use has been made of Eq. (6-74). Hence, the kernel 
[6°T/d~.(v)d@.(z)] is the inverse of i [67G,/dj(z)dj(x)]| j=; We may now set 
@. = 0, according to the assumption that (6G,/6j)|;=0 = 0. The previous identity 


tells us that the connected two-point function G?)(z — x) = —[6?G,/5j(z)5j(x)]|j=0 
is the inverse (for convolution) of the function 


— iP y —z)= —i[6°T/dg.(y)de-(2)], <6 


From translational invariance I’'?) depends only on the difference y — z and 


J= IAP.) 


(6-76) 


J= iP) 


| d*z Ty — z)G(z — x) : id*(x — y) (6-77) 


In momentum space, this reads 
G(p, — pT (p, — p) = i 
where the Fourier transforms of the I are defined as in Eqs. (6-20) and (6-21). 


Henceforth, we shall use shorter notations for the two-point function Gp) = 
G)(p, — p), V'?(p) = T(p, — p). If we write the former as 


2p) = ioe 6-78 
where Z(p) is called the self-energy, we have 
Tp) = i[G(p)]~* = p? — m? — X(p) (6-79) 


' We may now a a (6-78) as (see & ig. 6-24) 


G%p) = 
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© — -@:-6o0-- 


4 ee >= — + = : a ae 
= ‘ae a ip A =e eee we - 


Figure 6-24 Graphical representation of Eq. (6-79). The white blob represents the complete propagator 
G'(p), the shaded one the proper self-energy —iZ. 


which means that —i is the one-particle irreducible perturbative contribution 
to the two-point function. We obtain iI?) by addition of a zeroth-order 
contribution i(p? — m?). 

The interpretation of higher functions as one-particle irreducible (proper) 
functions may be derived by successive differentiations at g, = 0 of the master 
identity (6-76). For instance, after one more differentiation with respect to @,(u), 
we get (see Fig. 6-25) 


0=i | ate T(z, y, u)G?(z, x) + [ats d*y T@(z, y)GO'(x, z, v) (vn, u) 
and, using (6-77), 
Rees y> z) aa [are d*y' d*z' C20. Me Ze Xe (y's vis (29 z) 


= —iG®) (x, y, z) (6-80) 


trunc 


Thus il is the truncated function (or one-particle irreducible; there is no 
distinction for the three-point function). The reader may pursue this process and 
be convinced that the higher iI” are indeed the one-particle irreducible functions. 


+ crossed 
diagrams 


u 
+ 


Figure 6-25 Graphical representation of Eq. (6-80) and ofa similar identity for the four-point function. 
As in Fig. 6-24, the white blobs represent connected functions G™, the shaded ones proper functions 
ir”, 
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It may be simpler to rewrite T(g.) = 4 J d*x d+y P(x — y)e.(x) ey) + F(@,) and to solve Eq. (6-74), 


together with (6-72) as 
dG Oo # 6G 
— | d*x Ty — x) —* =j +o t( =) 
| ane * ten! \° 


" Using Eq. (6-77), we get 


i eC (2) ] 
- i {a vy G(x = y)| jy) + Soy) \ 15) (6-81) 


This identity appears most clearly in the pictorial representation of Fig. 6-26. Any connected diagram 
either contributes to the two-point function G{?) or has a tree structure made of the generalized 
vertices I“”(n > 3) connected with complete propagators G{?’. On the other hand, we know that any 
connected diagram may be decomposed into one-particle irreducible subdiagrams. To identify the 
latter with the iI”, it remains to see that all diagrams appear with a correct counting factor. 
This is just a matter of inspection. 

The interpretation of these remarkable identities will be pursued in Chap. 9, where we shall 
develop the method of functional integration. We also mention that further Legendre transformations 
might be used to define two-particle irreducible kernels, etc. 


From this discussion, it must be clear that, to lowest order (tree diagrams), 
T'(g,) reduces to 


1 
eae [atx £[9.(x)] (6-82) 
The h factor has been restored according to the rule derived above. Indeed, 
the proper two-point function is given to lowest order by 


1 ( d*p, d* ; 
FP ex, y) = ; | om = eilpi: xt Pr: »(2n)*5*(p1 ai P2)(pt pa m?) 


and thus contributes 


~ 4 | d*x d*y T?(x, y)p(x)@c(y) = 4 [asx [(d~.)* — mz] 


J 
° 1] ] 
5G.) _ «* i 5 ame x j 
i5j(x) (n-1)! i 
j 
x x 
= Sey + 0-0 fe ee 
Ge) G2) pa) Gap 


Figure 6-26 Diagrammatic representation of the identity (6-81). The conventions are the same as on 
Figs. 6-24 and 6-25; the black blob represents the generating functional 6G,(j)/idj(x). 
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to I'(@,), whereas the higher T'”, to the same order h~', reduce to the original 
vertices of the lagrangian. 

The importance of the functional I'(@) stems from the fact that it generalizes 
to higher orders the classical action; I'(@) is often referred to as the effective 
action. It plays an important role in the discussion of the stable fundamental 
field configurations, for instance in the discussion of spontaneous symmetry 
breaking, as will be seen in Chap. 11. 

The previous considerations may be extended without difficulty to several 
species of scalar fields, or to fields with a nonzero spin. In the case of half-integer 
spin, we have to use anticommuting sources (as in Sec. 4-3), and the “classical” 
field w.—the analog of g,—also belongs to such an anticommuting algebra. Some 
care is needed in the manipulation of these anticommuting functions, but the 
Legendre transformation still yields the generating functional of proper vertices. 


6-2-3 Parametric Representation 


We shall now consider an arbitrary proper diagram G in a scalar theory and 
compute the corresponding contribution I(G) as given by Feynman rules. We 
assume that G has no tadpole, ie., has no internal line starting and ending at 
the same vertex. As before, J and V denote the number of internal lines and 
vertices, respectively. It is convenient to give an arbitrary orientation to each 
internal line. Define the incidence matrix {¢,,}, with indices running over vertices 
and internal lines respectively, as 


1 if the vertex v is the starting point of the line | 
Ey = 4 —1 if the vertex v is the endpoint of the line | 
0 if ] is not incident on v 


This V x I matrix characterizes the topology of the diagram. We introduce the 
following definitions. A subdiagram G’ of G is a subset of vertices and lines of G, 
such that no vertex is isolated; we do not assume that, given two vertices of G’, 
all lines connecting them in G belong to G’. A tree on G will be a subdiagram 
containing all vertices of G but no loop. G may be reconstructed from one of its 
trees by addition of lines. From Eq. (6-69), we know that a tree has V — 1 internal 
lines. Its incidence matrix, a V x (V — 1) matrix, has a rank less or equal to 
V—1. Let us show that the rank is indeed V — 1. If we discard an arbitrary 
vertex of the tree, then there exists a unique one-to-one correspondence between 
the lines 1,,...,/,-, and the remaining vertices v,,...,vy—,, such that Ev,i, # 0. 
For any other correspondence, one ¢,,, at least vanishes. This means that the 
corresponding (V — 1) x (V— 1) minor of the incidence matrix equals +1. This 
holds true for any such minor, and thus the incidence matrix of a tree diagram 
has rank V — 1. 

For an arbitrary connected diagram, the condition L= I + 1 — V> 0 implies 
that the V x I incidence matrix has a rank p less or equal to V. Since Mea 
for any != 1,...,1, p < V—1, and since ¢ is obtained by the addition of further 
columns to the rank V — 1 matrix of a tree subdiagram, p = V — 1. 
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Let us now consider G as a Feynman diagram contributing to some proper 
Green function I'(p;,...,p,). The p are incoming momenta, Yi pi= 0, and we 
denote by P, =) p,, the sum of incoming momenta at the vertex v; clearly, 
yy_, P, = 0. The contribution I¢(P) of G to if (p) = (27)*5*(>’ pi (p;) depends 
‘ only on the P, provided of course that we consider a theory without derivative 
couplings. This quantity has the form 


I¢(P) = x (22)*54*(D’ P)Ig(P 


C(G ] ails i z 

Fai | TI Gaye any, Liane" (R ~ et te 
Here S(G) is the symmetry factor of the diagram and C(G) stands for all factors 
pertaining to the vertices: for instance, C(G) =(—iA)” in the A(o*/4!) theory. 
The four-momentum k, has been oriented according to the orientation chosen 
along the /th internal line. The theory under study might involve various species 
of scalar fields—hence the subscripts on the masses m,. In a scalar theory with 
derivative couplings or if nonzero spin fields are introduced, polynomials in k 
would appear in the numerator of the integrand of (6-83). This is only a minor 
complication and will be illustrated in practical instances in the subsequent 
chapters. Next we use the following integral representations of the free scalar 
propagator and of the 5* function: 


i 
p? —m? + ie 


(2n)*54 ( > ok ~ [ay exp |- iy. le =) ca) (6-84b) 
l iI 


In (6-84a), the integral converges at the upper limit owing to the presence 
of ie. This ie will be omitted in the sequel (m? may be regarded as having a 
small imaginary part). 

We insert the representations (6-84) into (6-83) and boldly interchange the 
order of integrations. This is illegitimate if the integrals are not absolutely con- 
vergent, which occurs frequently, but will be justified in Chap. 8 by the processes 
of regularization and renormalization. The integrations over each four-momentum 
k, may be carried out easily 


4 
(< a exp E (# +2 E—— 2 Yp* el) 
4 2 1 2 
- (os x exp ji (. oF om vo) =a 4oz (r Je) | 
: 2 
= AA*? exp |- z(s su | 


2 \ da. cist? —m? +c) (6-84a) 
0 
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where A stands for the Fresnel integral 


AA* = 
(4700)? 

Therefore 

(27)*6*()° P)Ic(P) 

2 
‘, —iexp |= ame + (5 tu) i | 
= 4 eee NO Al | anaes 
ne ng | Lie (Arca, aa: 


(6-85) 
The integration variables y are then reshuffled by the change of variables 
W= 244+ 2Zy 


y2 = 22 + Zy 


Yv-1 = Zy-1 + Zy 
yv = 2y 


the jacobian of which is one. Since ’, ¢,, = 0 (the line | connects two and only 
two vertices), zy appears only in the last exponent of (6-85). Therefore, integration 
over zy yields the 5* function of energy momentum conservation, leading to 


Ig(P) = 


V=1 74 
V-1 00 = exp {=i ame 1 ( be stu) ia on 
v=1 


Ce 
0 l (41c«;)7 ia 


The (V — 1) x (V — 1) matrix d¢(a) defined as 
1 
[de(a) |... = » Ey 41 =a Ey 
I a) 
may be shown to be nonsingular, and its determinant reads 


1 
Ag(«) = det [de(x)] = ¥ [] — 
GF le F 
where the sum runs over all trees of G. Since every tree of G has V — 1 lines, 
Ag is clearly a homogeneous polynomial in the «; 1. For «>0, A is positive. 
The remaining (V — 1) four-vectors z may then be integrated over and the result is 
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V-1 
; exp} y Pa [dé "allo? 
takP) = | 8 (doy e~ imi) - “Thar ere $y a Nola]? 


This formula expresses J¢(P) as a function of the invariant scalar products of 
external momenta P, - P,,. The functions of « that appear in the exponent or in 
the denominator may be given a form independent of the choice of the vertex V. 
On the one hand, the denominator 


Pe(a) = ay -*: Aca) => [] a ie (6-86) 
Fl¢ F 
is a homogeneous polynomial of degree L clearly independent of V. On the 
other hand, the quadratic form 


|= ik 


Oc(P, a) = a Pee lidenediinne,. 


may be expressed as 


Y se [] « (6-87) 


where the sum runs over all possible “cuts” C of L+1 lines that divide the 
diagram into two and exactly two connected parts G,(C), G2(C). Such partitions 
are obtained from the trees of G by cutting an (L+ 1)th line. Then sc denotes 
the square of the momenta P entering G,(C) or G2(C): 


#4 2 
oe ( x r.) = ( z P,) (6-88) 
veG\(C) veG,(C) 


Therefore Qc is a ratio of two homogeneous polynomials of degrees L + 1 and 
L respectively. 
The final parametric form reads 


eiQc(P,a) 
[i(4n)?}"[Pe(@)]? 
with Ag and Q¢ given in Eqs. (6-86) to (6-88) and we recall that L=J+1—-V. 


The proof of the results (6-86) to (6-88) and detailed properties of the functions 
#¢, Qc skipped in this elementary presentation may be found in the references. 


I¢(P) = \; ; (da, e~ im?) (6-89) 
ou 


Let us illustrate the rules (6-86) to (6-88) on an explicit example. The diagram of Fig. 6-27 contributes 
to the four-point function in the g* theory. We recall that the symmetry factor is S(G) =) atic 
C(G) = (—iA)*. The trees J are (1,12), (11/3), (14), (2/3), (lol4). Hence the polynomial Y is equal to 


Pe = 3h4 + O24 + A203 + 104 + 103 
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P, 


Figure 6-27 Sample diagram contributing to the four-point 
P, function. 


We now list the cuts C and the corresponding sc: 
(lylals) sc = P3 = (P2 + Py 
(ill)  sc= P? = (P2 + Ps 
(IxIgl4) Sc = P3 = (Py + Ps)’ 
(lalsla) Sc = P} = (P; + Pa)’ 
Choosing P, and P, as independent variables, we have 


Oc(P, «) = Pe '[Pia,a2(a3 + 4) + Pharesee + (Pi + P2)? 203%] 


We may use the homogeneity properties of A, and Q;: 
Qc(P, Ax) = AQc(P, x) 
. P(Aa) = A* Pe(a) 
to further simplify Eq. (6-89). We insert in the integrand the identity 


foe) I 
0 i 
and rescale a; > Aw. Then 


1 ae O(1— > om 
I¢(P) — EC oak |, [].¢en da near (73 a p- an exp iH [ontr.9 ) ape mia |} 
(6-90) 


The integration over the « has been reduced to the interval (0, 1) because of the 
6 function. In the A integration, the convergence at the upper limit does not 
give any trouble, owing to the ie implicit in m?. The integral converges at the 
lower end provided 2V — I — 2 > 0. In terms of Euler’s gamma function, 


yg 
j3V-1)-20 eer a(1 i y a) 
jo) =" y-Fev-1-2)| [| de: _ 


(41) 01 [Pe(a)}?| Oc(P, «) — y an 2V—-I-2 


(6-91) 
The convergence condition, rewritten as 2L — 4V + 4 < 0, just expresses that I¢(P) 
has a negative dimension in an energy scale. This may be read on Eq. (6-83) or on 
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Eq. (6-91), since Q¢ has dimension [E]*. When this dimension is nonnegative, 
the integral diverges at the lower limit, 4 ~ 0, which is of course the translation 
in this parametric representation of the ultraviolet divergence at large k of the 
integral (6-83). This relation between dimension and occurrence of divergences 


‘foreshadows the power counting that will be our touchstone in the renormalization 
procedure. 


The parametric representations (6-89) to (6-91) are interesting and useful in 
many respects. They provide an explicit form which may (or may not!) be con- 
venient in practical computations. They may also be used to study analyticity 
properties of Green functions or to carry out the renormalization program. 
Finally, they exhibit fruitful analogies with the theory of electric circuits. 


6-2-4 Euclidean Green Functions 


The Green functions of a (scalar) theory are functions of the invariant scalar products of their 
external momenta, which are real Lorentz four-vectors. As we have seen at the end of the preceding 
chapter, and as we shall see in the next section, these functions enjoy analyticity properties and may 
be continued to unphysical regions. Here, we dwell on the continuation to the euclidean region 
where field theory presents interesting analogies with statistical mechanical problems. We shall not 
formulate this euclidean theory ab initio, but rather derive its perturbative rules. Again, for simplicity, 
we deal with a scalar theory. 

Consider a proper function I"'(p,,..., pn) and assume that its arguments satisfy the condition 


n 2 
( Yu nm) <0 (euclidean region) (6-92) 
k 


=H! 


for any real (u,). The manifold satisfying this condition is a linear space-like subspace (a hyperspace) 
of momentum space. If we consider P as a function of the invariants p;- p; (i,j = 1,...,.2- 1) let us 
compute the dimension of the total manifold in the absence of this constraint, and the one of the 
submanifold (6-92). We must, of course, remember that in a four-dimensional space, any set of more 
than four vectors is linearly dependent. Hence for n > 6, the p are not independent. Consequently, 
the invariants p,- p; belong to a manifold of dimension 4n — 10 for n > 4 (1 and 3 for n = 2, 3 respec- 
tively). If condition (6-92) is taken into account this restricts them to a submanifold of dimension 
3n — 6 for n>4 (1 and 3 for n=2, 3). For n=4, both have dimension 6 (for instance, s, ¢, u, 
pi,-.-, p32, with  p? = s + t + u), whereas for n > 5, the euclidean submanifold (6-92) has a dimension 
less than that of the whole space. 

In the euclidean domain (6-92), all the p are orthogonal to some time-like vector n. After a 
Lorentz transformation, we may choose n = (1, 0,0,0) and thus p? = 0 for all i. This enables us to 
associate to each p; a vector p; of a euclidean R* space: in that frame p? = p? = O and p; = p;. Then 


Bit By = op? + pip} + p2bF + Bip; = — Pi Ds 


If we now pick a diagram contributing to I” and express its contribution according to Eq. 
(6-89), we may define P,, the sum of the p; entering the vertex v, and to each “cut” C of the diagram 
we associate [compare (6-87) and (6-88)] 


2 Z 
i= ( y a) = -( Y p) = ~s 
veG,(C) veG,(C) 


QOc(P, a) = aa 4 sc [] a, = —Qe(P, a) 
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ImA 


Rear 


(a) (b) 


Figure 6-28 The Wick rotation in parametric space (a) or in momentum space (4). The initial contour 
of integration is indicated by a single arrow, the final one by a double arrow. On (5), the crosses 
® stand for the position of the poles k? = +[(k, + Zp)? + m? — ie]"/. 


Therefore, on the manifold (6-92), we may rewrite (6-90) as 


es © dA exp {—iA[Q¢(P, a) + } um? ]} 
= = =e pig 2b) 2a a a 
oe | ( 1 7) a Se I A [i(4n)?}*[Po(x1]? 


We recall that the m? have implicitly a negative imaginary part. This, together with the positivity 
properties of the bracket of the exponent (for m7? > 0), allows us to rotate by —7z 2 (i.e., clockwise) 
the integration contour in the complex 4 plane (Fig. 6-28). This is, of course. equivalent to a simul- 
taneous (— 7/2) rotation of all the variables « in Eq. (6-89). It must be stressed that the ie prescription 
has played a crucial role in telling us which rotation was allowed and that the Wick rotation, as it is 
usually called, is illegitimate if some m? happen to be negative. In momentum space [compare with 
(6-83)], the Wick rotation amounts to a 2/2 rotation (counterclockwise) of all kP in the frame where 
all p? = 0. This is in agreement with the position of the poles of the propagator, located at 


k? = +[(k + Dip)’ + m? — ie)]*? 


After this rotation, we get in the euclidean region 


wo f Dp 2, 
Wis a ad exp —[Qc(P, «) + um? ] 
I¢(P) = (-—i)"~ *Ie(P) = (—-3) ft | eee — eer 


The factor C(G) in front of I¢(P) [compare with (6-83)] contains i” coming from the expansion of 
exp [i §dtz Fin({Z)| to the Vth order. We set C(G) = i” C(G) and rewrite (6-93) as 


C(G) CiG). 2 
S(G) S(G) 


(6-93) 


For instance, in the Ay* theory, C(G) = (— A)”. We recall from the analysis of Sec. 6-2-2 that the 
proper functions have been identified with the iT” defined through the Legendre transformation 
(6-73) and (6-75). Therefore, thanks to this last factor i, these functions I coincide with the real, 
euclidean Green functions P™: 


r(p,, trey Pr) a P@,, ae) Pn) 


defined as sums of the contribution [C(G)/S(G)] I¢(P) of each diagram. It is easy to see, by repeating 
backward the same computation as in Sec. 6-2-3, that 
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‘ Cer. Cie) d*k, 1 
Drow =—— |T] i ) 5 ie 
(2x (>: p) sS@ Ig(P) s@ | + Qn ie ee II (2n)*54 @ on 2, eats) (6-94) 


where now k, are euclidean four-momenta, k? = ee. (ki)?. Here, as in Eq. (6-93), the integrand 


enjOys positivity properties. Of course, Eq. (6-94) means that [ may be computed by euclidean 
‘ Feynman rules: 


Vertex -A instead of —iA 


(6-95) 


1 
Propagator = instead of ———__—_ 
k? + m? k? — m? + ie 


From these rules, we learn that connected Green functions (with external propagators) are related by 
GE(D1,-- +5 Pn) = i(—i"GL(D1,..., Br) (6-96) 


in the coincidence region (6-92). 
In configuration space, these functions read 


- n d* — . 
G{X1,...,X_) = \n oat e” '% **(27)*54() Gi) Ge(G1,°**, Gn) 
1 
with x°q = x°q° + x'g' + x?q? + x3q°. We then introduce the generating functional 
aes 1 : ae: 
G.(j) = y: at dx,°°- dXn j(X1)° i * (Xn) Ge(Xi, Si 1 Xn) 


and a similar expression for I'(,). Collecting all factors i in Eqs. (6-71b), (6-72), (6-73), and (6-96), 
it is then straightforward to check that I'(g,) is the Legendre transform of G,: 


P(e.) = kz = [ax je0.00 (6-97) 
where ¢, and j are related by 
5G. j) 
He) 6-98 
oa 6j(x) pina 


The lowest-order euclidean functional in the g* theory is 


es it = m? A 
Pree(@.) = — fats #;[¢(x)] = — [as (; YD GPO. Pe + ae ar ot) 
k=0 : 
Notice the change in sign of the (@9@)* term as compared to (6-82). 
The euclidean theory may be considered as a statistical mechanical theory, the weight of each 
field configuration being exp [ — Sdtz £;(~)]. This will appear more transparent after the introduction 


of functional integrals (Chap. 9). 
As an exercise, the reader may discuss the meaning of the Wick rotation when spin 3 fields 


are present, namely, what happens to the ) matrices and what is the relation between w and yw. 
He or she may then write the Feynman rules for euclidean quantum electrodynamics. 


6-3 ANALYTICITY PROPERTIES 


In the last section of Chap. 5, analyticity properties of amplitudes were derived 
from the general principle of local causality. As a typical example, the two-body 
forward elastic scattering amplitude has been proved to be analytic in the energy 
variable, under suitable conditions on the values of the masses of the particles. The 
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analyticity domain is a cut plane with two branch points at s= M% and u= 
2(m2 + mz) — s = M2 (at t = 0), corresponding to the lowest intermediate states 
in the direct (s) and crossed (u) channels. The discontinuities across the cuts 
were related through the optical theorem to the total cross section of the channel. 
Therefore, we expected that the amplitude will have singularities along the real 
axis at all values of the energy corresponding to the opening of a new threshold, 
ie., the onset of a new possible final state. For instance, the nN + 2N forward 
scattering amplitude is expected to have singularities at s (or u) = (my + nae, 
(my + 2m,)’, etc. 

In the present section, we shall present a sketchy discussion of analyticity 
properties of Feynman integrals. The interest of such a study is threefold. First, 
it is important to test in the framework of perturbation theory the general 
analyticity properties derived rigorously. Whenever we are able to prove from 
general axioms the existence of an analyticity domain, the contribution of an 
arbitrary diagram must go through the successive steps of the proof and must 
therefore enjoy these analyticity properties, or rather their equivalent perturbative 
expression. However, when the masses are such that the general proof fails, the 
study of individual diagrams may prove useful. Second, such a study allows to 
search for complex singularities or to study analyticity in several variables. Even 
though we suspect that the perturbation series does not converge and therefore 
that the exact amplitudes may have different properties from those of individual 
diagrams, this investigation can be suggestive of fruitful conjectures. Finally, 
dispersion relations in one or several variables may be a useful device for the 
computation of Feynman amplitudes. 

We shall frequently refer to the physical sheet of a scattering amplitude. 
We have in mind the region reached by analytic continuation, starting from 
threshold and using the Feynman ie prescription. In this process we do not con- 
sider the possibility of crossing the various cuts arising from the singularity 
structure of the amplitude. 


6-3-1 Landau Equations 


Two types of singularities arise in a function defined by an integral 


re =| ax f(x, 2) 
c 


along a contour C, for instance, an interval (a, b) of the real axis. The function 
f(x, z) is assumed to be analytic in the variables x, z, but for singularities located 
at x = x,(z). Clearly F(z) is analytic for any value of z such that there exists an 
open neighborhood of C free of singularity. If some singularity x,(z) approaches 
the contour C, and if the contour may be deformed to avoid it, then F remains 
analytic. Therefore, singularities of F as a function of z are expected in two cases 
where the contour C may no longer be deformed: 
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Figure 6-29 Singularity arising from the pinching of the 
Poe contour between two singularities x,(z) and x.(z) of the 
X2(z) integrand. 


1. One of the singularities x,(z) approaches one of the endpoints of the contour: 
as Z — Zo, X,(z)-> a or b; Zo is called an endpoint singularity. 

2. The contour C is pinched between two singularities x,(z) and X2(z) (Fig. 6-29): 
aS Z— Zo, X;(z) and x2(z) approach the contour from below and from above; 
Zo is a pinch singularity. 


Elementary examples are provided by the following integrals: 


F(z) = _— = = eee ian ts 6-99 
z aie a +Z We Jz (6- a) 
oe ot ee 

ra |) emia" | : (6-996) 


— i dx ae 
rev |) ae =H ie (6-99c) 


In the first example, = = 0 is a pinch singularity. As z + 0,, the contour is trapped between the two 
poles x = i,, 2 and x = —i,'z. In the second case, z = 0 and z = 1 are endpoint singularities. The 
pinch singularity at z = 2 appears in all Riemann sheets of the logarithm, except for the first one 


(corresponding to the principal determination). Indeed in the first sheet, the contour of integration 
does not cross the point x = 2. However, going to further sheets obliges us to deform the contour 
of integration. Pinching may now happen. The same phenomenon occurs in the third example (6-99c). 
Here the singularity at z = 1 is an endpoint singularity while the appearance of the singularity z = 0 
in every sheet except the first one comes from a pinching at infinity. This is clear if we change the 
tegration variable x into u = 1/x. 

This discussion must be extended to functions of several complex variables 
x; and z;, F(z;) = Ju([ |i dx) f(xi, z;). The boundaries of the integration domain, 
the hypercontour H, are a by a set of analytic relations (x, z) = 0. 
The singularities of the integrand f(x;,z;) take place on analytic manifolds 
SF,(x, Z) = 0. Singularities occur when the hypercontour H is pinched between two 
or more surfaces of singularities or when a surface of singularity meets a boundary 
surface. More precisely, it may be shown that a necessary condition of singularity 
is that there exists a set of complex parameters /,, J, not all equal to zero such 


thamat px), 272) : 


AAU, 2°) = 0 for all s 
A, A(x, 2°9)=0  forallr (6-100) 


ae abe Ae Fr(% z) + 2 As F(X, a | 


=) for alli 


x9,z9 
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The last condition expresses that the hypersurfaces are tangent at the pinching 
point. This is only a necessary condition. Determining whether the hyper- 
contour is really pinched requires a detailed study. 

We apply these general results to the case of Feynman integrals, first expressed 
in minkovskian momentum space. We consider 


dive | To (6-101) 
ol P) = I (Qat)* aan k? — me F ie 

The notations are the same as in Eq. (6-83), but all internal momenta k; have 
been expressed in terms of a set of loop variables q, and of the external momenta 
P. The boundaries of the integration domain are at infinity. In this elementary 
discussion, we shall disregard the possible occurrence of endpoint singularities at 
infinity and introduce no term Y,. The singularities of the integrand are given 
by the equations Y; = k? — m? = 0. The Landau equations are nothing but the 
expression of the necessary conditions -— in this case: 


Ak? — m?) = for all i= 1,...,] (6-102a) 
0k; 
> Ak: =0 L=sieeee, , (6-102b) 
i 04a 
The second equation may be rewritten 
* YS (+)4ki = 0 (6-103) 
ie YL, 


using the fact that those k that depend linearly (and with a coefficient +1) on 
the loop variable q, lie along a single loop, denoted by %. The interpretation 
of the first equation (6-102a) is that singularities occur only when, for every 
internal line, either the four-momentum is on its mass shell k? = m? or the 
parameter A; vanishes. In the latter case, the ith line never appears in the singularity 
equations. The singularity is the one of a reduced diagram where the ith line 
has been contracted to a point. 


It is instructive to see how the Landau equations are derived in other integral representations. Before 
considering the parametric form studied in Sec. 6-2-3, we introduce a mixed representation 


1 L a*q, of (1 = Ya) 

I Pp = A i i ” 

lei al {ul i. hs G se 
[5 ai(k? — mi) 
i=1 
obtained from Eq. (6-101) by means of the identity 
1 day +++ dard(1 — ¥ a) 

= (I —1)! Oo = 

eer (P= 1) | : : (6-105) 
i=1 


Now the singularities of Ic(P) arise from the zeros of the denominator ¥ = Y\ «i(k? — m?) pinching 


the hypercontour in (a, q) space or meeting the boundaries Y, = o%; = 0. The singularity equations 
read 
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A Si (6-106a) 
im; = 0 HES Tenonll (6-106) 
a (xv +>} ia) = (i) a iE (6-106 
Bai j Roaae -106c) 
fs) 
a (xs + > un = 0) jase] (6-106d) 


The case J’ = 0 leads to 4; = 0 for all i. We discard this trivial solution and assume Y = 0. Then 
Eq. (6-106c) gives 


while Eq. (6-106b, d) lead to 
aj(k? —m3)=0 j=1,...,1 


We recover the previous equations (6-102) with the 4 replaced by the a. 

The parametric representation (6-91), valid when the diagram is superficially ultraviolet finite 
[that is, 2L — 4(V — 1) < 0], is obtained from Eq. (6-104) by integration over the loop momenta q. 
Disregarding possible singularities coming from the zeros of P(«), we take Y = Oc(P, a) — ye , um, 
SF, = a, and write singularity conditions as 


ea) 
Aa; = 0 i=1,...,J (6-107) 


Ae A; j 
aa. us gly =() i) = loooep ! 


Again 4’ = 0 is a trivial solution. The conditions may be rewritten as «;0.//da; = 0 for each j, which 
implies Y = )';a;0Y/éx; = 0, owing to the homogeneity of . To show the equivalence with the 
Landau equations (6-102) we need to reintroduce a momentum variable k; for each internal line, 
defined in terms of the external momenta and of the « parameters. They are chosen to satisfy an 
equation of the type (6-103) where 4; is replaced by a,. With this definition the condition «,0S/da; = 
0 boils down to Eq. (6-102a). 

In the search for solutions to the systems (6-106) or (6-107), the condition > a; = 1 may be 
omitted, because of the homogeneity property of the function /. 


In either representation, the solution corresponding to A; (or a) 4 0 for all i 
is referred to as the leading singularity, while a solution where A; (or «;)=0 
for ic ¥ is called a nonleading singularity. In the associated reduced diagram 
R = G/¥, where all lines ic .¥ have been contracted to a point, this singularity 
is a leading singularity. 

The interest of the Landau equations is to cast the problem of determining 
the location of singularities in an algebraic form. However, finding the general 
solution of these equations, even for simple diagrams, is an arduous task. We shall 
pursue this program only in the case of real singularities. 


6-3-2 Real Singularities 


Real singularities are those occurring for real values of the invariants s;; = P;- P; 
on the physical sheet. Notice that these real values of the invariants do not 
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necessarily correspond to a physically possible kinematical configuration. For 
instance, the region s = (Pa + Po)” < (ma + mp)” is a real but nonphysical interval 
for elastic scattering of two on-shell particles of masses ma, Mp. 

For real values of the invariants s;; and a nonvanishing imaginary part (— ie) 
given to the internal masses, no singularity appears along the real contour of 
integration in parametric space. Therefore, singularities appear only in the limit 
e—0. 

It may be shown that any real solution of the Landau equations corresponds 
to a pinching of the integration contour as ¢ > 0 and thus gives rise to a singularity 
of the integral. 


To see this we use the parametric form (6-91) of the integral. Consider first a leading singularity, 
occurring at some real value s = s‘°’ of the invariants. The Landau equations ¢'7 ¢z, @ 0 have a real 
solution a; = a{® (j = 1,..., J). In the vicinity of this point, 


4) OF 
a i oy | Aa; Aa; + O(Az?) 
2 i,j=1 00:80; | OC 


After diagonalization of this quadratic form 
AS =% yd G i: AB? 


with real eigenvalues o;, we see that in any variable f, (i= 1...., I — 1), the solutions of AY — ie = 0 
approach the real axis from both sides, except maybe for exceptional configurations where o, = 0. 
Therefore, there is a pinch singularity in every successive f; integration. The case of a nonleading 
singularity is easily disposed of. Suppose that n parameters x, vanish at the solution of the Landau 
equations: a{° = 0, ie %. After integration over those parameters in a neighborhood of the origin, 
we are back to the first case. 

The same method may be used to study the type of singularity. Suppose that we are near the 
singularity, s ~ s, and consider the contribution to the integral from a neighborhood of a = a. 
We write 


1 aS 
OS. ie F 00%; 2 ijeR Ou; 00; 
i i 


Ax Ax; 


where the first term comes from the variation As, = s, — s\°’ of the invariants, the second one from 
the n parameters (i¢.%) near zero, and the third from the parameters of the reduced diagram R. 
As the a;, i¢ .%, vanish, the denominator ?,(«) in Eq. (6-91) vanishes as a power L(.%), where 
L(¥) is the number of independent loops of the subdiagram associated to .%. A simple counting 
argument yields the following behavior for the integral J¢(P) as a function of the number of loops and 
internal lines of the reduced diagram 


I¢(P) ~ (As) 
- (6-108) 
where k = (2L— 1) + [n— 2L(9)] + 37 — 1 — n) = 4[4L(R) — 1(R) - 1] 
Whenever k is a nonnegative integer, this behavior may be modified by logarithms 
Te(P),_ yo ~ (As) In™{As| k=0,1,... (6-109) 


This result is actually independent of the singularity being real. We conclude that, typically, Feynman 
integrals have logarithmic or square root branch points as functions of the invariant scalar products 
sij = P; : ie 
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Rok Figure 6-30 The bubble diagram. 


Interesting properties may be proved concerning real singularities. In para- 
metric space, at the solution of the Landau equations, the derivatives OS /00,; 
vanish for all nonzero a\°). Therefore, such a solution corresponds either to a 
local extremum or to a saddle-point of Y on the compact set A = {0 <q <1, 
> a; = 1}. On the one hand, it may be shown that it cannot be a minimum. 
For instance, the self-energy diagram of Fig. 6-30 is singular at P? = (m, + m2). 
For this diagram the function reads 


0102 P? 
Oy + X2 


Cf == 


— ami — am} 


and for P* = (m, +m)? has a maximum zero on A at a, = m2/(m, + m2), a2 = 
m,/(m, + m2), the solution to the Landau equations. On the other hand, saddle- 
point configurations do not seem to appear for Feynman integrals (although no 
proof of this point is known to the authors). Consequently, we restrict ourselves 
to solutions corresponding to local maxima of Y, called thresholds, where both 
real and imaginary parts of the amplitude are singular. The imaginary part gets a 
new additive contribution above the threshold, as we shall see, while the higher- 
order derivatives of the real part are divergent. 

A further classification of thresholds appears useful. Normal thresholds are 
those singularities whose occurrence is expected from unitarity as recalled in the 
introduction of this section. Let us introduce the following definition. A set of 
/ internal lines to G is said to be an intermediate state of G if cutting /— 1 
lines among them leaves the diagram connected, while a further cut disconnects 
it into two parts G,, G2, each having incident external momenta. Notice that, in 
contrast with the concept of cut C introduced in Eq. (6-87), G; and G2 may still 
contain loops. As in Eq. (6-88), we define (see Fig. 6-31) 


2 2 
=e r,) 5 r) 
veG, veG, 
aaa, 
= Cuo hl — a 
cutee 


Figure 6-31 A diagram with an intermediate state of masses 
m,,..-,m, and the corresponding reduced diagram. 
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Then 
s= sO = (Sm)? (6-110) 


intermediate 
state 
is a normal threshold of the amplitude. Physically, s‘° is the smallest value of the 
invariant s such that the physical state of masses m,,...,m) can be created. 


Let us show that such an intermediate state does correspond to a solution of Landau’s equations. 
We equate to zero all x pertaining to lines that do not belong to the intermediate state, say 
Oj+1,---, 7. Then for the reduced diagram 


Hy 
S(04,...50%, 0,...,0) = ———— ~ 2m am? 


= 1 Oj 


The equations 0Y/éa; = 0 have a solution at «fs =) mi, x =m, ' () m;'), and this is easily seen 
to be a zero and a maximum of Y. 


Contrary to naive expectations, there exist other real singularities, called 
anomalous thresholds. These singularities are somehow troublesome since their 
contribution to the absorptive part of the amplitude cannot be directly related 
through unitarity to physical processes. The condition for the existence of 
anomalous thresholds in a scattering amplitude corresponds to a situation where 
the axiomatic derivation of dispersion relations is no longer valid. It is therefore 
important to control the possible occurrence of these new singularities. This is 
a difficult problem and we shall only illustrate this phenomenon on simple 
examples in the next subsection. 


In momentum space, the distinction between normal and anomalous thresholds may be interpreted 
in terms of the dimension of the space spanned by the internal momenta k; of the reduced diagram, 
at the solution of the Landau equations. For a normal threshold, these momenta form a one- 
dimensional space. This is obvious from the set of equations (6-102) for the reduced diagram depicted 
in Fig. 6-30. These equations express that all the momenta k; (i = 1,...,!) are collinear, and hence 
collinear with the external momentum P = Dawe): On the contrary, the dimension is larger than 
one for an anomalous threshold. 


6-3-3 Real Singularities of Simple Diagrams 


The general considerations will be illustrated on one-loop diagrams. First we consider the bubble 
diagram of Fig. 6-30. We are interested in analyticity properties in the variable s = P?, where P is 
the total energy momentum entering one or the other vertex. The diagram may represent a self- 
energy contribution in a g® theory, or a scattering amplitude in a o* theory, etc. At any rate, the 
solution of the Landau equations is trivial. We write the integral in momentum space as 


T¢(P?) = | Soa (6-111) 
(2 any (k? — mj) [(P — k)? — m3} 


Here instead of I¢(P) defined in (6-83) and (6-101) we consider the quantity T,(P) = (—i)”*! I¢(P) that 
contributes additively to the scattering amplitude of Eq. (5-171). The Landau equations express the 
existence of two real numbers 4, and 42 such that 
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Ay (k? —m*)=0 
A2[(P — k)? — m3] = 0 (6-112) 
Aiky — A2(P bean k), | 0 


, This means that P and k are collinear, and it follows that m?1? = m3A3, s = P? = (m, +m )?. The 
value (m, + m2)? is the expected normal threshold, while the other one, (m, - m)?, will soon be 
shown not to occur on the physical sheet. It seems that we are cheating, since the integral (6-111) 
is logarithmically ultraviolet divergent and requires a renormalization. Only the value of Tg(P?) 
subtracted at some point is meaningful, but this subtraction does not modify either the Landau 
equations or the singularity structure. In parametric space, the subtracted integral reads 


1 |, da, daz 5(1 — a, — a2) Ot Ot2S/(, + 2) — amt — ar] 


Tg(s) — Te(s1) a (4x)? 0 (0 + a2)? 


0104281 /(y + 2) — ami — am} 


as is readily seen from Eq. (6-90) if the integration over J is carried out after subtraction of the 
integrand at an arbitrary value s,. The singularities come from possible zeros of S = a, 0250, + 
G2) — &mi — a2 m3 and the equations 0.7/da, = 0(i = 1, 2) lead to 


7 D4 

aS aS: 
mi = ——> m3 = ——— 
(a, ta 2) (a; ar a2) 


Hence s = (m, + m2)” since a, a2 > 0. The integral may be computed explicitly: 


Anz ms + m3 — s+ A1/2 m? — m3 
4n7[ Te(s) — Tels.) = | —] : s ae = 
[ Te(s) a(s1)] | 2s n (“ qamk — 5 — aie 95 


m2 
In = - [s=s,] (6-113) 


In this expression the last term reminds us that the amplitude is subtracted at s = s,, and A stands 
for the function introduced in (5-155a): 


A = A(s, mj, mz) = s? + mi + m} — 2(mi + m})s — 2m3m3 = [s ~— (m, + m2)*][s 4 (m, — m2)? | 


The argument of the logarithm can be rewritten as 


mi tmi—s+ atl Sm: + m2)? —s + /(m; — m2)? — 5 


2 2 ae i 7 | ss 
mi + m3 —s — dl (m, + m2)? — s— (m, — m2)? —s 


and in this form it is clear that in the physical sheet, corresponding to the first determination of 
the logarithm, Tg(s) is free from singularity at s =(m, — m2)?. On the other hand, singularities at 
s = 0 and s = (m, — m2)? occur in the unphysical sheets. 

Even though this is not the main purpose of this subsection, we may elaborate a little on these 
complex singularities. The singularity at s =(m, — m2)’, also called a pseudothreshold, is seen to 
correspond to a solution of the Landau equations in parametric space at a negative value of a 
or &2, thus outside the initial integration interval. This means that going to unphysical sheets has 
forced us to deform the contour of integration from 0 < « < 1 to a complex path. This 1s the analog 
of the phenomenon discussed in the example (6-99b). Similarly, the singularity at s = 0 is the analog of 
the case (6-99c). This singularity, which did not appear as a solution of the first set of Landau 
equations (6-112) is an illustration of the so-called second-type singularities that we have discarded 
by assuming that the boundaries at infinity of the integration domain in momentum space were 
irrelevant. They are not, actually, because a pinching at infinity may occur which leads to the 
singularity at s = 0, as seen in the form (6-111) of the integral. The Landau equations express that 
the hyperboloids k? = m? and (P — k)? = m} are tangent, which occurs when P? = (m; + mz)’; but the 
contact may also take place at infinity, when the centers coincide or are separated by a zero length 
vector, that is, s = P? = 0. 

The appearance of a singularity at s = 0 must be a warning. The analysis of complex singularities 
is indeed .. . complex! 
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Py 
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Figure 6-32 The triangle vertex diagram, and its dual diagram. 


In the physical sheet, the imaginary discontinuity of Tg(s) across the cut running from (m, + m)? 
to +00 is 


_ A+!?(s, m3, m3) 


Tg(s + is) — Tg(s — is) = 2iATg(s) = 2i fi O[s — (my + m2)?]} (6-114) 


From this absorptive part, the amplitude T¢(s) — Te(s,) may be reconstructed by a once-subtracted 
dispersion relation, i.e., by the Cauchy formula along a contour encircling the cut: 


S80 fie ds’ Im Tg(s’ 
Te(s) — Te(s,) = —*# } eee IE (6-115) 
(m,+m;)* (s a S;)(s _ s) 
We now turn to the vertex diagram of Fig. 6-32a: 
d*q 1 
ee --i( ———— 6-11 
olPts Pas P= —! | Oe OF = mid — ma — m8) a 


with k, = q — po, kz = q + px, and k3 = q. The Landau equations consist of the system 
A{k? — m?) = 0 


i= 1,2,3 
Aiki = 0 


For the leading singularity, none of the 4 vanish; therefore the determinant of scalar products (k;- kj) 
must vanish. This is more conveniently rewritten as 


1 yi2 yi3 
yi2, 1 ya3 | =O (6-117) 
Vise vase 
where yj; = kj+kj/mjm; = —(pe — m? — mj)/2mim,, (i, j, k) being a permutation of (1, 2, 3). 
Similarly, for the nonleading singularity occurring at 43 = 0, say, we must have 
yy 
jd = 0 
Yi2 


that is, 
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We recover the normal threshold at s = p3 = (m, + m,)? while, as in the case of the bubble 
diagram, the singularity at p3 =(m, — m2)? does not show up in the physical sheet. This may be 
seen by the analysis of the Landau equations in parametric space. The discontinuity across the cut 
between (m; + m2)? and +00 is (with s = p3) 


2i a+b 
Te(s + ie) — Tg(s — ie) = ————-__ In ——_- z 
G(s + ie) — Te(s — ie) Gimme Ge (6-118) 
where a= s* — s(pj + p3 + mj} + m3 — 2m3) — (mi — m3)(p? — p3) 
and b = A'/?(mj, m}, s)A*/? (pi, p3, s) 


Of course, similar normal thresholds exist in the channels pj and p3. 

To analyze the anomalous threshold, we introduce the dual diagram of Fig. 6-32b. It is a tetra- 
hedron in three-dimensional euclidean space with edges of square lengths p? and k?. This is possible 
since p? > 0 and the k; are on-shell, k? = m?. The stability conditions of internal and external 
particles insure that the angles 6; (0 < 0; < x) defined through 

6, - — amin 
2m,m2 
are real. Equation (6-117) implies that the dual diagram lies in a plane. Finally, the reality condition 
requires the central point of Fig. 6-32b to lie inside the triangle. This means 6; + 62 > mor, equivalently, 
cos 6, + cos 6, < 0, that is, 


my (pt — m3 — m3) + m2(p3 — mi — m3) > 0 (6-119) 


Let us derive this result using the parametric representation, in the special case where pj = p3 = pt, 


? = m3 =m’. The function ¥ reads 
2 
__ 12 p3 + a2aspi + 30 p2 


, + Az + 43 


SF 


— am? — 02m} — a3m3 


and the Landau equations, after elimination of «, = 1 — x2 — 43, yield, for x = %2 = %3 (O<x<$4 
since 0 < a; < 1), 
a?(4p? — p3) = m3 
(4p? — p3)(a? — a) = m? — p? 

The solution « lies between 0 and $ provided 

. p?>m+m3 (6-120) 
This expresses condition (6-119) in that special case. The anomalous threshold takes place at 

1 


1 
a ae 
3 


3 


p= — 

m 

If we consider a physical instance such as the electromagnetic form factor of the nucleon, it 

seems that we are in trouble since an anomalous threshold might occur below the normal one. 

Indeed, the lowest state coupled to the photon is the two-pion system, m= m, = m2 2 ™m,, and the 

lowest one coupled to the on-shell nucleon (p? = pj = pi = Ms) is the pion-nucleon state: hence 

m + m3 =m, + My. If we take m3 = m = (m, + My)/2, all the previous conditions are fulfilled; Eq. 
(6-120) reduces to 

Mi >4(My + m,)?  -that is, (\/2 — 1)My = ms 


and is obviously satisfied by the experimental masses. Fortunately, this reasoning has not taken into 
account the various conservation laws, e.g., the conservation of baryonic charge which tells us that 
any state coupled to the nucleon must contain at least one baryon, of mass larger than My. Thus the 
condition (6-120) cannot be fulfilled and we are safe. Similarly, anomalous thresholds are not present 
in the form factors of pions or kaons, but they appear for hyperons or deuterons. 
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Py | P3 
my 
s ui my, M4 
my 
P2 Pa 
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Figure 6-33 The box diagrams in the s — t channel (a) and in the crossed u — t channel (5). 


The analyticity properties of the scattering amplitude depicted in Fig. 6-33a, the box diagram, 
may be studied along the same lines: 


a if atk (6-121) 
oo") Qa [e+ — m3) [D1 — ps + K? — mea] [pa — ky? — me? — m3) 


In the s channel, we find a normal threshold at s = (m, + m2)? [in the t channel at ¢ = (m3 + ma)? ]. 
The discontinuity across the corresponding cut may be evaluated: 


Te(s + ie, t) — Tg(s — ie, t) = 2iA, Te(s, t) 


5; 284tl2(6, mi, ma), [(ac — bd cos 6) + Dt? 
- 4nD1/2 (ac — bd cos 6) — D*? 


ots — (m, + m2)*] 


(6-122) 
where @ stands for the center of mass scattering angle 
s +, (ee 
AN?(s, pi, p)A"/?(s, p3, pa) s 


The expressions of a and b have been given in (6-118), and c and d are obtained from a and b by 
changing p, into ps, p2 into p4, m3 into mg. Finally, 


D = (ac — bd cos 6)* — (a* — b?)(c? — d?) 


cos 8 = 


i j } j ¢ a 2 2 2 Z 2 
Thgse expugegions simplify in the case of equal masses mj = m3 = m*, m3 = m3 =m’, pi = p3 = 
P3 = pa = p” to 
2sA1/2(s, m2, m3) 


pis = {st [ls — 4m?)(t — 4m) — 4(p? — m? — mi27]} 1 (6-123) 


The discontinuity A, 7¢ considered as a function of t has itself a discontinuity in the t channel, called 
the double spectral function p,,: 


1 
PsilS, t) a a [A; Te(s, t+ ie) =A; To(s, a ie)] 


2sA1/(s, mi, m3) 


Therefore, To(s, t) satisfies fixed s or fixed t dispersion relations, or even an analytic representation in 
the set of variables s and t: 


PERTURBATION THEORY 313 


hire Ee 
fesetient |” sagidaltn 
( 


T Jim y+m2)2 s'-—s 


1 Pst(S’, t') 
=— as dt <= SS 
e {| se aa 


The second expression in (6-125a) involving the double discontinuity ps, isa special case ofa Mandelstam 
representation. The box diagram considered here has vanishing double spectral functions p,, or Py. 
This is not the case of the crossed diagram of Fig. 6-336, for which p,, only is nonvanishing, and 
which therefore satisfies the fixed s dispersion relation 


(6-125a) 


ee A. Te(t’, uv’ = 2 i = co) aa pe oer, 
Tatu == | an g(t’, u DP t 242f ay betel > p? —s—u',u’) 
T Jimyt+ma)? | i 4 T J (y+ m2)? u’—u 
(6-125b) 
or the Mandelstam representation 
1 Prt’, u’) 
Te(t, u) = — | | dt’ du’ ———____ 6-125 

On ete | | = 0 Cee 


All this is not completely correct. In the foregoing, we have assumed that all external particles are 
stable against decay: pj <(m, + m3)*, etc. and that no anomalous threshold occurs. The precise 
conditions for this may be studied in the same way as in the vertex case. 

It has been conjectured that, more generally, in favorable cases such as pion-nucleon scattering, 
the physical scattering amplitude satisfies a general Mandelstam representation of the form 


(s — s1) i prl(s’) re 


a? (s’ — s)(s' — 83) 


Te(s, t, u) = Te(s1, fi, uy) et 


_o* \, Bai ao, “) [* palw') a 
= ame — t)(r =ty) m Jaz (u’ — u)(u' — uy) 
(s a Si)(u hae uy) {| , , PsulS's u’) 
i nt ae (s’ — s)(s' — s)(u' — u)(u’ — m1) 


, ite tu — uy) {ee a Pult’, u’) 
rect) 


(t’ — t))(u' — u)(u’ — uy) 


“s 
Sesh — ti) pe Pse(s’, t’) 6-126 
er | | - jee = eae ee 


a 


where the double spectral functions p,,, p,,, and p,; are nonvanishing in definite regions (the shaded 
areas of Fig. 6-34). In Eq. (6-126), we have assumed that subtractions were necessary. 

Beyond its illustrative purpose, the study of these simple diagrams may be useful as a preliminary 
step before tackling more complicated cases. For instance, through the use of a majorization technique, 
analyticity properties of higher-order diagrams may be deduced from those of simpler “primitive 
diagrams.” For this subject, we refer to more specialized references. 


6-3-4 Physical-Region Singularities. Cutkosky Rules 


Physical-region singularities correspond to real external four-momenta. Such is 
the case of normal thresholds, while anomalous thresholds do not appear as a 
leading singularity in the physical region if the stability condition of external 
particles holds, i.e., if for each vertex the square of the incident four-momentum 
P? is smaller than the lowest normal threshold of that channel. 
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Figure 6-34 The stu plane for pion-nucleon scattering. The areas shaded by diagonal lines are the 
physical regions of the s channel tN > nN, u channel nN > xN, and t channel xn > NN; mand M 
stand for the pion and nucleon masses respectively. The double spectral regions have been shaded by 
dots. 


Coleman and Norton have found the following simple interpretation. The 
leading singularity of a diagram G occurs in the physical region if and only if the 
vertices of G may be regarded as real space-time points and their internal lines as 
the trajectories of real on-shell relativistic particles. 

To prove this statement, it may be more convenient to use the mixed repre- 
sentation of Eq. (6-104). For physical-region singularities, the integration variables 
a; and g, must take real values at the solution of the Landau equations. The 
latter read [compare with Eqs. (6-102), (6-103), and (6-106) | 


k? = m? (6-127a) 
Yd (t)ouk: = 0 (6-127b) 


ieL, 


The reality of the external momenta P and of the integration variables g implies 
the reality of the k. These four-momenta are on their mass shell, owing to (6-127a). 
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Figure 6-35 Illustration of the Cutkosky rule: the cut 
represented by a broken line contributes to the 
singularity at s = (m, + m2)*. The momenta k, and kz 
are on their mass shell: kt = mi, k3 = m3. 


As for Eq. (6-127b), it means that if particle i propagates with momentum (+k;,) 
during the proper time interval «;m; (a; > 0), the total space-time displacement 
along a closed loop is zero. In other words, the diagram may be considered as a 
genuine physical process. 

The Cutkosky rules give a compact expression for the discontinuity across 
the cut arising from a physical-region singularity. Let I¢(P) be the Feynman 
amplitude of Eq.(6-101) and Tg(P) = (—i)’ * !Ig(P) the corresponding contribution 
to the scattering amplitude. The discontinuity 2iAT¢(P) in a given channel asso- 
ciated with a specific cut is obtained as follows. Let (J — I, — I,) internal momenta 
be on-shell, kj = mj. and I; + I, momenta off-shell. These may be subdivided 
into two groups pertaining to two subdiagrams G, and Gz, each with J; internal 
lines and V; vertices (Fig. 6-35). The following formula holds: 


d*q rH i 
nyt AA, kB = mi + ie 


1 L 
ATe({P) = 5 (ae i {1 


a T]  (2n)0(4e9)5(k? — m3) (6-128) 


7) . 
2 — Mi, — 16 jzty+,+1 


The role of the 6 functions is to put the particles corresponding to the inter- 
mediate state on their positive energy mass shell. In the physical region, Eq. 
(6-128) follows from the fact that individual diagrams satisfy the unitarity condition. 


The following elegant proof is borrowed from Nakanishi. We start from an arbitrary diagram, assumed 
for simplicity to arise in a scalar theory. We observe that we may always construct a hermitian 
lagrangian such that the lowest-order scattering amplitude for a definite process is the diagram under 
consideration. To this end, we attach to each internal line | of the diagram a different species of 
field, g), of mass m, (the mass of the corresponding propagator). Moreover, if an external line (or 
set of lines) of momentum P, enters the vertex v, we attach to this vertex a field ¢,, of squared mass 
M2 = P?. Provided P? > 0, the initial diagram may now be regarded as the lowest-order contribution 
to the on-shell scattering amplitude of the particles described by $1, $2,---, derived from the lagrangian 


I o1) (6-129) 


te 0 


I 


LY, = Y 4$[(00)? — m2 g?] + ¥ 4164.)? — Mido] + > 6.( 


1=1 


In the last sum, ¢, must be replaced by one if no external line is incident to v, and the product 
runs over all internal lines incident to v. This construction is illustrated in Fig. 6-36. If i denotes the 


316 QUANTUM FIELD THEORY 


(a) (b) 


Figure 6-36 The diagram (a) has the same analyticity properties as the diagram (b), which may be 
regarded as the lowest-order contribution to the on-shell amplitude a+6-—c +d in the theory 
described by 


5 


Y= Y $00)? — meg?] + YX 2[(2b.)? — MiG2] + bugig: + Op193* PeOrOaPs + baP3PaPs 
1=1 a,b,c,d 


initial state (characterized by P° > 0) and f the final one, it is straightforward to check that 

Tj: = TelP) 
where 7;; is computed from Y,, while Tg is simply the amplitude associated with the initial 
Feynman diagram G. Since unitarity holds for on-shell processes [Eq. (5-154)], we have 


T 53 = T =1 as (22)*5*(P, — Jedd = 2iATg(P) (6-130) 


In the language of Feynman diagrams, the sum runs over all possible intermediate physical states n, 
that is, such that the intermediate momenta are on their mass shell 
k? = m? kP>0 les 
Unitarity thus yields (6-131) 
2ATe(P) = ¥ Te,(P)TG,(P) 
where the sum runs over all partitions of G into two subparts G,, G2: G; (respectively G2) is con- 


nected to the initial state i (respectively final f) and G is the union G; UG, U | ¥}. It also includes 
an integral over the intermediate-state phase space. We thus recover the Cutkosky rules. 


The reader will verify this rule in the case of one-loop diagrams. 

We may also try to extend it beyond the physical region, thereby generalizing 
unitarity. This is, for instance, what is needed, if we want to compute the double 
spectral functions defined in the last subsection. However, proving these general 
Cutkosky rules requires nontrivial methods. In particular, much care is necessary, 
since the meaning of the 6'*? distribution for complex k becomes hazardous. 


NOTES 


Covariant perturbation theory, born in the mid-1940s, is due to S. Tomonoga, 
Prog. Theor. Phys., vol. I, p. 27, 1946; J. Schwinger, Phys. Rev., vol. 74, p. 1439, 
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1948, vol. 75, p. 651, 1949, vol. 76, p. 790, 1949: R. P. Feynman, Phys. Rev., 
vol. 76, p. 769, 1949; and F. J. Dyson, Phys. Rev., vol. 75, pp. 486 and 1736, 
1949. The definition of a covariant propagator was anticipated in the work of 
E. C. G. Stueckelberg; see his paper with D. Rivier, Phys. Rev., vol. 74, p. 218, 
1948. 

Scalar electrodynamics was considered by P. T. Matthews, Phys. Rev., vol. 80, 
p. 292, 1950. The method used in the text is from F. Rohrlich, Phys. Rev., vol. 80, 
p. 666, 1950. 

The early calculations on electron-electron and electron-positron scattering 
are recorded in the book by N. F. Mott and H. S. W. Massey, “Theory of 
Atomic Collisions,” Oxford, 1956; see especially chap. XXII. Polarization effects 
are discussed in W. H. McMaster, Rev. Mod. Phys., vol. 33, p. 8, 1961. 

The Legendre transformation was introduced in field theory by G. Jona- 
Lasinio, Nuovo Cimento, vol. 34, p. 1790, 1964. For a general survey of diagram- 
matics see G. ‘t Hooft and M. Veltman, “Diagrammar,” Cern report 73-9, Geneva, 
1973. 

Topological properties and parametric representations of Feynman diagrams 
are studied in the book by N. Nakanishi, “Graph Theory and Feynman Integrals,” 
Gordon and Breach, New York, 1970, which also contains an introduction to the 
analyticity properties. 

Among the many contributors to this study let us quote L. D. Landau, Nucl. 
Phys., vol. 13, p. 181, 1959; S. Mandelstam, Phys. Rev., vol. 112, p. 1344, 1958, 
vol. 115, p. 1741, 1959; and R. E. Cutkosky, J. Math. Phys., vol. 1, p. 429, 1960. 
An interpretation of physical-region singularities is found in S. Coleman and 
R. E. Norton, Nuovo Cimento, vol. 38, p. 438, 1965. 

More details and references can be found in “The Analytic S-Matrix” by 
R. J. Eden, P. V. Landshoff, D. I. Olive, and J. C. Polkinghorne, Cambridge 
University Press, 1966; “Analytic Properties of Feynman Diagrams in Quantum 
Field Theory” by I. T. Todorov, Pergamon Press, Oxford, 1971; and in the text- 
book of J. D. Bjorken and S. D. Drell, “Relativistic Quantum Fields,” McGraw- 
Hill, New York, 1965S. 


CHAPTER 


SEVEN 
RADIATIVE CORRECTIONS 


The renormalization program of quantum field theory is presented and carried 
out to the order of one-loop diagrams in electrodynamics. It is then applied to 
the calculation of the magnetic moment anomaly, radiative corrections to Coulomb 
scattering (involving an analysis of infrared divergences), the atomic Lamb shift, 
and photon-photon scattering. We also include a discussion of induced electro- 
magnetic long-range forces between neutral particles in the relativistic regime. 


7-1 ONE-LOOP RENORMALIZATION 


We undertake in this chapter the study of higher orders of perturbation theory. 
What appears, at first sight, as a straightforward exercise, requiring perhaps 
analytical skills, turns out to be a highly nontrivial problem due to the presence 
of ultraviolet divergences. The general presentation of the renormalization theory 
is postponed to a later chapter. In order to get some familiarity with the subject 
we concentrate here on the computation of radiative corrections to lowest order 
in quantum electrodynamics. This enables us to see how we extract sensible 
results from apparently ill-defined expressions, to compare them with experimental 
values, and to progressively introduce the concepts of renormalization. A serious 
drawback of this approach is that electrodynamics is in that respect a rather 
involved theory. We have to cope with gauge invariance and to disentangle 
infrared from ultraviolet divergences. Nevertheless, its amazing successes certainly 
make this effort worth while and justify inverting the logical order. 
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The parameters such as masses and coupling constants which appear in the 
lagrangian are not directly measurable quantities. In the classical point-particle 
theory, for instance, we must add to the bare mass an electromagnetic contribution 
to obtain the physical inertial mass. The latter is, of course, finite while the former 
may well be infinite. We shall therefore give an operational definition to the 
fundamental parameters (finite in number). Renormalization theory will then show 
that the perturbative expressions for Green functions are finite when expressed 
in terms of these physical parameters. Masses will generally be defined as isolated 
poles of two-point functions. The corresponding residues, which appear as multi- 
plicative constants of scattering amplitudes, will be absorbed into the definition 
of renormalized fields. Finally, coupling constants will be chosen by fixing the 
value of certain amplitudes at appropriate points in momentum space. 

In order to carry out this program it 1s better to deal first with well-defined 
finite quantities. The origin of divergences lies in the singular character of Green 
functions at short relative distances. Equivalently, in momentum space the Fourier 
transforms do not vanish fast enough at infinity. In an intermediate step we 
are then led to regularize the theory, i.e., to replace the original expressions by 
smoother ones such that the integrals become finite. We shall thus proceed in 
three steps: (1) regularize, (2) renormalize, and (3) eliminate the regularizing 
parameters. Renormalization will be successful if finite quantities are obtained 
as a result of this process. 


7-1-1 Vacuum Polarization 


Let us first take a look at the photon propagator in momentum space. To the 
free contribution 


(Gov—Kpkv/u? . Kk pky/u? ) yw 
i — jee ee M? =— 7-1 
Gonlh) (Se 5 et A ce 
we should add a correction which, according to the rules of Chap. 6, is given to 
lowest order by (see Fig. 7-1) 
(Ue) = GLU RG PY (kyGi) 
Gov (k) ee Gop (k)@ (k)G, v(k) (7-2) 


rey ey d*p we i : i ) 
ge (2x)* Vp—m+ic’ p—k—-m+ic 
The additional minus sign arises from the fermion loop. The integral seems 


quadratically divergent for large internal momentum p. To give it a meaning we 
use the Pauli-Villars regularization. This amounts to minimally coupling the 


Dac Figure 7-1 Photon propagator to lowest order. 
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photons to additional spinor fields with a very large mass Asm. These fields 
might correspond to indefinite metric sectors of the Hilbert space. As far as the 
vacuum polarization tensor @(k) is concerned, this prescription implies the 
replacement 


BS 
o'(k, m) > @?"(k, m) + ys C,@°"(k, Asm) (7-3) 
s=1 


where the substitution is understood under the integral sign of Eq. (7-2). Were it 
not for the divergence of the integral we would recover the original value in 
the limit J, > oo. The constants C, will be chosen in order to remove this divergence. 
The minimal coupling of the additional fields implies that gauge invariance 1s 
preserved in this regularization procedure. 

Denote collectively by the symbol A the large masses A,m and let ’"(k, m, A) 
stand for the right-hand side of (7-3). We have 


: d*p tr y°(p + m)y"(p — K + m) 
y ese Sr rrr 
aw? (k, mM, A) mor e {= io = m2 A ig) [(p — kj? aa m? ar ig | 


s 
++ y C,(m > ian} 
s=1 


age? ( 4°0. [ptle— + ip = 5)? ola 
= (27)* (p? — m2 + ie) [(p = k/ — m? + ie] 


Ss 
+ ¥ Cm i (7-4) 


The function w*"(k) is the Fourier transform of the current-current Green function. 
To any order, current conservation leads to the condition 


k ,@°"(k) = 0 (7-5) 


which is formally satisfied, since 


d*p 1 1 
2 ele ee pede eee ey, 
: |e (Ky al 


= we | FF jo) 
(2)* p-—Kk-—m+ie p—m+ie 


where we have replaced k by (p — m + ie) — (p — kK — m + ie) and used the cyclic 
invariance of the trace. If the integral were convergent a translation of integration 
variable p — p + k in the first term would insure that condition (7-5) is fulfilled. 
While this manipulation is meaningless for the original expression, the same steps 
are justified for the regularized tensor, showing that we have a sensible regulariza- 
tion. 

To evaluate (7-4) we use the parametric representation introduced in the 
previous chapter with the additional complication arising from the numerators 


k,w?"(k) 
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in the fermion propagators. We therefore write 


: dplfeaoe «ea aro 
p(k, m, A) = —4e? = LS te 2 
ovl ) 7 |e i beky hy) on ae 9” = jm )| 


x is day | doa exp (i{ay(p? — m?) + a2[(p — k)? — m?] 


Ss 


+ 21°p + 22°(p—k)}) + pac Cima) 


Z,=2z2=0 


Auxiliary four-vectors z; and 2, have been introduced to generate the integrand 


in (7-4) by differentiation. Integrating over p and performing the required deriva- 
tives we obtain 


E ia |° |? da, da, 20102 OL Xk? 
@® p(k, m, A) = — ee Pe = meee. 
onl ™ \ ( (x; + o)° (2 + 2)? ae EE: (a + a)? 
+m? |> exp <i| —m?(a Bega) i 2 
oa ae 04a) : : Hy + 2 


5S 
+ 3 C.(m > iam) 


The polynomial in k appearing in the integral can be rearranged to read 


P Xy X2 1 Xo l 
a | o—= | HO 
a _—" (a +o) 9° E (a1 + a2)? (% + = 


Our calculation has remained covariant but @,, does not explicitly exhibit current 
conservation, which requires it to be proportional to the combination k,k, — 
gpvk~. The second term in the above expression does not have this structure and 
its contribution reads g,,A@, with 


ie id fo lS cand = 04 2k? i 
nol (*f ten of st] 
T Jo Jo (a1 +2)” s=0 (a1 + a2) 01 -F G2 


04 ot 
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where by convention Co = 1, m§ = m*, m2 = A2?m?. This can also be written 


ea -2| \ siting, é L y Gs exp {ip| —mblay + a) 
0 


o (a + ay)? P dp Pp s= 
20] 
Oy + H2 


For fixed m, the constants C, are chosen in such a way that the integrals 
converge in the vicinity of w,,«; > 0. The convergence for large « is insured by 
the ie prescription implicit in the masses. We may then interchange derivatives 


jo 
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on p with integration. The expression 


i hi da, daz |-n Ly 2 | 
———- 9. C, Mi Og) k 
I | (a; + o2)°p % ae {i ola : wt te 


is then found to be p independent as shown by a change of variable «; > p— ty. 
Therefore A@ vanishes. The remaining term of the vacuum polarization tensor 
reads 


Dov = — i(g pyk? ae k ky) ss 
2 Zon) ie 1 O2 

2 = pot a 
@(k*, m, A) = = | \, da, daz et a,)* (7-6) 


< yc, oe m2 (x + 02) + i? 
s=0 : ; 1 + Q2 


Using the homogeneity properties we introduce a factor 1 = J% dp 6(p — a — 42) 
under the integral sign and change the variables according to x; > pa;. We obtain 


1 1 foe) KY 
ET | \ day daz 0(1 — %, — %2)a1%2 \ 2 ‘ioe 
Tk Jo Jo 0 P s=0 
The integral over p looks logarithmically divergent at the point p = 0. Having 
factorized two powers of momentum in passing from @°” to @ we have decreased 
the degree of divergence from two to zero. This is an example of the close 
relationship between the degree of divergence and the dimensionality of integrals. 
If we choose the coefficients C, in such a way that 


RY Ss 
y Catt > C=O (7-7) 
s=0 s=1 


the regularized integral over p will be convergent. Furthermore, let us pick k 
such that k? < 4m’, which corresponds to the threshold of pair creation. Since 
1, & are between zero and one and satisfy a, + a = 1, it follows that «,a2 < 4. 
Then (m? — «1%2k?) is positive and the integration contour in the complex p 
plane can be rotated by — 7/2 in such a way that the integral reads 


up 
lim ade y C, e — p(m? — a %2k?) 
n0 Jy 


P s=0 
= lim y ce (-< Im P| nd ” wt | dp inp 
102 0 


120. 


Condition (7-7) eliminates the dangerous In 7 term in the integrated expression. 
For fixed k? < 4m? and m2 = 12m? — co the result takes the form 


= in (m? — ayo2k*)+ VC, inne |= a in (1 - quo )+3 oy Caln 2| 
s=1 m? 


where we have neglected k* as compared to m2. Let us define A such that 
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——-_$_ eee Figure 7-2 The complex k? plane for the vacuum polariza- 
tion. The arrow indicates how the physical region is 


0 4m? reached above threshold. 
co A2 
y C, In 422 = —In os (7-8) 
s => 


We end up with the following expression for the regularized vacuum polarization: 
= 2a (3 A2 ke? 
@(k?, m, A) = -2| dp B(1 py) —In Sin — Bil = at 
a oy m m 


The remaining integral can readily be performed, leading to the analytic expression 


sins WS ee 2m? \| (4m? a 
co(k*, m, A) = £| in +5 +2(t+ Relive | 


2 1/2 
x arccot (* = 1) = i} (7-9) 


The calculation has been carried out under the assumption k? < 4m?. The corresponding function 
may be continued in the complex k? plane. The values for k? > 4m? are obtained by taking a limiting 
value from above the cut, starting at the point k? = 4m? (Fig. 7-2). The discontinuity across the 
cut is @(k? + ie) — @(k? — ie) = 2i Im @(k? + ie). This can also be derived using the last integral 
representation given above by setting 8 = (1 — u)/2, integrating by parts, and changing the variable to 
u = (1 — 4m?/k’?)!/?. We find that 


Ce) t 2 
ole Ce ae dk’? 1 4m? \1/2 2m ) 
A)=—In->+—k — —=—, || -— 1+ —- 7-10 
ae ie a kK? ke? ee 
a once-subtracted dispersion relation exhibiting the above analytic properties, with 
4 2\ 1/2 2 2 
i Im &(k2, m) = : ( ae =) (: Ee = ) (7-11) 


This absorptive part is independent of the regularizing cutoff A and hence cannot be affected by 
renormalization. It coincides with our earlier calculation of pair production by an external field, 
Eq. (4-105), expressed as the square of the corresponding amplitude (Fig. 7-3) and given to this 
order by 


W) = Re | d*k a,(k)@"(k)a,(—k) = [ett Im &(k)[|E(0|? — |B()|?] 


The only effect of regularization has been to provide the constant («/37) In (A7/m”), which is of course 
divergent if we let A go to infinity. 


> Figure 7-3 The probability for pair production in an external field 
giving the discontinuity of the vacuum polarization tensor. 
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Figure 7-4 Photon propagator in terms of the vacuum polarization. 


Before applying renormalization, let us return to the original expressions 
(7-1) and (7-2). We do not really intend to compute the photon propagator like 
that! The contribution G''! added to G'°! would produce a double pole besides 
the simple one. When we were discussing the ordering of perturbation theory 
according to the number of loops we had in mind the one-particle irreducible 
Green functions, the inverse propagator for the case at hand. The zero-loop 
contribution to this quantity is G9}! and its one-loop contribution is —al}), 
where we have added the suffix 1 to indicate that it was evaluated to order one: 


GEM — BE) = [2 — p2)g py — (1 — Adkeoks + Gouk? — kpky@""] (7-12) 


Inverting this expression we obtain the propagator represented diagrammatically 
by a sum, each term of which is a string of bubbles (Fig. 7-4). If more generally 
we succeed in showing to all orders that the vacuum polarization tensor has the 
form of a scalar function multiplying the combination (g,,k? — k,k,), then Eq. 
(7-12) with the index one omitted will give the general relation between the 
propagator and @ = @"! + G14 --- as 


6, Ger + BU Megke/u? _ kpky I 
oo e+ a= PR 


(7-13) 


Equation (7-13) can be compared with formula (5-80). At the time we were ignoring mass renormaliza- 
tion as well as divergences. To be more precise, assume that the denominator in (7-13) admits an 
integral representation of the form 
1 co k2 
- | ay? OE) (7-14) 
0) 


01 + a(k)) — 2 k? — kK? 


os @ continues to satisfy a dispersion relation of the type (7-10) (with a threshold at the origin as 
py? -+ 0, instead of 4m? as:it is to lowest order), we shall have the relation 


k? Im @(k?) 


Oe (ET Re] — + Im 


+ pole contributions 


If the integrals that we write remain meaningful we find 


1+ @(k?) -71 p? 
k?[1 + @(k?)] — py? ik? up k?[1 + @(k?)] — - 


= La we |; dk'? p(k’?) 
2 a = 


2 k’?) 1 ale? 
= py? dk’? et. Be ey 12 Z 
ih |, k’2(k? me k'?) oe k? 1 Lt A ki? p(k ) (7 15) 


According to (7-14) taken for k = 0 and assuming @(0) finite, the last term in brackets in the above 
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formula vanishes. Consequently, Eq. (7-13) reads 


% v— Kokyfk'? — kok 1 
Gena diet pies) Lemos Keke : 
ork) | Megat ie on (7-16) 


This is indeed comparable to Eq. (5-80). 


The resummation implied by Eqs. (7-12) and (7-13) has generated a new 
phenomenon. Indeed, the isolated pole at k? = y” of the free theory has at best 
been moved, and corresponds now to the zero of the expressions k?[1 + @(k?)] — 
uw. As far as the photon propagator is concerned this is not too drastic. Indeed, 
uw? has only been introduced for convenience, in order to cut off the infrared 
divergences in intermediate calculations. We shall always have to compute com- 
binations such as jf(k)G,.(k)j3(—k) where j,; and j, are conserved currents 
[k- j(k) = 0] in order to extract physical information. This combination eliminates 
the terms in k,k, and enables us to consider the limit yz? — 0. If instead we would 
have set * = 0 then G would have taken the form 


do, keky 1+. G(7)—-2 
Pil+o) kk ai+ak) © 


iG p(k) = =0 (7-17) 


Unless (0) = co, the quantity k?[1 + @(k?)] still vanishes at k? = 0. 

In a loose way we may characterize the property k?[1 + @(k)]|,2=0 =0 
by saying that there is no genuine mass renormalization for photons. But wave 
function renormalization is there, since the residue of the pole at k* = 0 is now 
equal to [1 + @(0)]~’, instead of being one. One-loop calculations reveal a 
remarkable circumstance. Namely, under regularization we find that the only 
potentially divergent and therefore unknown term in @ is a constant which may 
be taken as its value at k? = 0. If only j- G-j is measurable we may use two widely 
separated identical sources to define what we mean by the charge squared. We 
define it to be the coefficient of the 1/4xr Coulomb static potential at large 
distances. This appears here as e* = e3/[1 + w(0)] = Z3e6 where we have intro- 
duced an index zero to characterize the bare coupling constant used up to now 
in the perturbative series. The above definition of Z3 agrees with the one given in 
Chap. 5 in the limit u? = 0. We conclude therefore that 


7) eas Zz 2 
. - a (7-18) 
33> = = na In a} Bio 
1+ (0) 3x om 


To be consistent we have to express all quantities as series in a parameter 
counting the number of loops L in the bare Feynman diagrams. As we have 
seen in Sec. 6-2-1, it is natural to multiply such diagrams with a factor h’. For 
this reason if we concentrate on terms of order h! we may substitute a for a 
in the finite remainder w!'(k*) — @''1(0), since this is already of order hf as is 
& — og. This also justifies the use of « in the right-hand side of (7-18). 
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According to this definition of charge the renormalized propagator reads 


9 pv 
k7[1 + @(k?) — d(0)] 
It will have residue 1 at the photon pole k* = 0 and will be expressed in terms 
of the physical charge. 

We see the renormalization program emerging : undetermined and potentially 
divergent quantities disappear when Green functions are expressed in terms of 
physical renormalized quantities. 

When we set °C, In 42 = —In(A?/m?) the sign was chosen in agreement 
with condition (7-7). The intuitive content of (7-18) is that the bare charge has 
been screened at large distances by a factor Z3 = 1 — (x/3z) log (A*/m*) smaller 
than one and positive as long as A?/m? is not overwhelmingly large. Vacuum 
polarization corresponding to the creation of virtual electron-positron pairs has 
reduced the charge of a test particle as seen by a distant one. 

We may take another point of view in deriving equations such as (7-19). It 
amounts to saying that the original lagrangian is only a bookkeeping device 
enabling us to define the perturbative amplitudes. It was postulated from some 
intuitive correspondence principle between classical and quantum mechanics. As 
such it may be amended by quantum corrections. Let us therefore assume that 
to lowest order ¥ is expressed in terms of physical parameters. It will then be 
necessary to construct perturbative corrections of the form 6, called counter- 
terms (the coefficients of which will become infinite as the cutoff goes to infinity), 
in order to maintain the proper definition of these physical parameters: 


Lor = Le, m,...) + oF 
Of =F! oe) 4 oe 


iGR(k) = + terms in k,ky (7-19) 


(7-20) 


The various pieces of 6 are classified according to the corresponding powers of 
h counting the number of loops in the original diagrams. The fact that this 
number of loops may be expressed as powers of fi ties in nicely with the previous 
remarks. 


Our calculation may then be interpreted as identifying a contribution in 
6") of the form 


bf = —4(Z3 — 1 FF (7-21) 


Adding such a term to # will produce an additional diagram to order h (Fig. 7-5) 
and will modify the original (regularized) tensor according to 


® pv(k) 4 ® p(k) aa (Z3 za iy geek a KK) 


Figure 7-5 The two diagrams of order h for 
( ) wee = Vacuum polarization in the counterterm ap- 
proach. 
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@(k?, A) > @(k?, A) + (Z; — 1) = = In : + (Z; — 1)!" + [a(k?) — a(0)] 
3 m 


The new rule implies that we use x everywhere and not a. Also, in equation 
(7-9) the term independent of In (A?/m?) vanishes for k? = 0, justifying our intro- 
duction of a renormalized quantity @*(k*) = @(k?) — @(0). From Eq. (7-18) it 
follows that (Z3 — 1)! + (a/32) In (A2/m?) = 0 and therefore 


a er | ag 1 4m? \1/2 2m? 
@hie) = — se a =i 
=e” |r ae) |! +e 


a (1 a 2m? 4m? 1/2 4m? 12 
ea + a 2 ( ae 1 arcot ( oe 7 - if} 
(7-22) 


This expression is meaningful for real k? smaller than 4m. It has to be analytically 
continued above the cut for k? > 4m’. Note the following interesting limits: 


a k? k* 
eS +0(4) oe 


a ie Sa ee 5 


The behavior at large negative k” has a striking similarity (including the coefficient) 
with the original divergent term involving the cutoff. This is, of course, not an 
accident, and will be better understood in due time. 

The screening of charge implied by the vacuum polarization has physical 
consequences. Using the renormalized expressions and concentrating on static 
situations where k? = —k?, we see that the Coulomb law is modified by the 
replacement e? > e”/[1 + @(—k?)] very much as in ordinary dielectric materials 
where we would have e?/e(k) with ¢ as the dielectric constant. For small enough 
k? (as compared to m* which sets the scale) we can approximate the Coulomb 
interaction as 


OR(k?) = 


e? e2 e2 ; i oO k? 
— ———— Se 
k?—k?[1 + @*®(—k?*)] ~ k? 15x m? 
a result originally due to Uehling. 
In configuration space for an infinitely heavy nucleus of charge — Ze located 
at the origin this means a correction 
Ze? Wea Zee —Ze> a Ze 
= — — re 
4nr 1Sxz m 4nr Anr 15x m 
The effect of the added term has to be computed to first order, otherwise it 
would be inconsistent to keep only terms of order « in w*. Expanding around 
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zero to leading order in (k?/m?) has produced a singularity in configuration 
space. However, we only need the mean value of the potential in unperturbed 
states so that it is not crucial that the actual correction, strongly peaked for 
small r, is replaced by a 6 function. Observe that this enhancement at small r 
is in agreement with the idea that for smaller distances we recover the unshielded 
coupling to the “bare” charge. 

Applied to an hydrogen-like atom the above perturbation leads to a dis- 
placement of s-wave levels according to 


2 
SE yy = — oe | Br WES (NY, 
: 1Sam 
Zae? 4 Z*a3 
= FTEape 01:01Hx.0(0)|? al (Se on m61,0 (7-23) 


where we have used the value of the Coulomb wave function at the origin 
Wn.o(0) = (nn?.a*)~ '/?, a = (Zma)~ '. In the Dirac theory the two n = 2, j = 3 levels 
of opposite parity 2S1)2 and 2P;/2 are degenerate. The vacuum polarization correc- 
tion has the effect of lowering the s state by an amount AE-h= (1/h)(Eps, , — 
E2p,,,) = —27 MHz (Fig. 7-6). In his 1947 measurement of this level splitting, 
Lamb obtained a value of the order of +1000 MHz, proving that other effects 
largely overcompensate vacuum polarization. In Sec. 6-3-2 the full theory will be 
shown to agree with experiment, providing indirect evidence therefore for this 
screening of the charge. 


It is possible, of course, to avoid the use of the small k* approximation. A static charge located 
at the origin will induce a modified Coulomb potential to order « given by 


ae (r) = re Q(r) 
2a i Wie = hye 
= ji ars, —2mru poate 
Q(r) =F = l due (: a5 sa) a 
(7-24) 
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with y equal to Euler’s constant -fe dulnue™“ = 0.5772.... According to the definition of charge 


Q(co) = 1. We see again that as r decreases Q(r) increases, even becoming infinite as r tends to zero. 
The approximation used in Eq. (7-23) is only valid in the mean. 


2P\/2 


~—_———._ 25), 
—{s 


Figure 7-6 The vacuum polarization contribution to the 25,,. — 2P; 2 hydrogen splitting. 
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Finally, we remark that the above conventions amount to the use of a photon propagator 
with residue equal to unity at the photon pole (when py? > 0). The factor Z3;'/? attached to photon 
lines in scattering amplitudes has therefore been replaced by one. 


7-1-2 Electron Propagator 


The first nontrivial contribution to the one-particle irreducible Green function 
with two external electron lines (also called electron self-energy) is shown dia- 
grammatically on Fig. 7-7 and reads 


d*k 1 g Akt. 
— = (— ie) a 
iy'(p) ( ie) |G i F am we a oT (k? jaa we + ie)(Ak? = Te ae ig) 


i 
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This expression, a 4 x 4 matrix function of p, unfortunately suffers from all 
possible diseases. It has an apparent linear ultraviolet divergence (meaning that 
the integrand looks like f d*k/|k|> for large k). We shall see that this can be 
reduced to a logarithmic divergence. Infrared divergences might also creep in for 
small k and special values of the external momentum; we therefore keep y* 4 0 
to be on the safe side. Also gauge dependence is expected, as would be the case 
in nonrelativistic physics for a wave function of a charged particle. 

To deal with meaningful quantities it is necessary, as before, to use a regulariza- 
tion of the free propagators. To avoid cumbersome notation we shall assume 
that a cutoff A? has been introduced in an appropriate manner, without being 
more explicit. We write 


P 1-4 
Bp Se) ee) (7-26) 
in such a way that )"“(p) corresponds to the choice of Feynman gauge (A = 1), 


“ sb 
and use the parametric representation of propagators to express )\*” after 
integration over the loop momentum k as 


& |” 1 waemdes Oy ) 
*(p) = — ———, [2m — 
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x exp [ ( 7 p? — op? — aan) | (7-274) 


ay + H2 


Dp p-k P Figure 7-7 Electron self-energy to lowest order. 
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The masses are understood as having a vanishing negative imaginary part, ensuring 
convergence for the large «. 

As in the case of photons, the self-energy parts are classified according to 
the number of loops, and the propagator itself is obtained by summing the 
series (Fig. 7-8): 

i i 


ee 
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We proceed by computing )\(p, A), where the cutoff A will soon make its 
appearance when we try to integrate over a common dilatation factor of the 
o% parameters: . 


Sve = i, { da, daz 5(1 — a1 — a2) 


x | — (2m = op) eip(a1a2p? — au? — a2m?) (7-29) 
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The integral over p is only logarithmically divergent due to the extraction of the 
kinematical factors m and p, in contradistinction to the classical linear divergence 
of self-mass (Chap. 1). Were m so large that |p| «<A «m, corresponding to a 
static classical limit, then we would recover this linear divergence in A. 

In order to have a relativistic invariant regularization we may subtract from 
>’ the contribution of a fictitious massive photon with p replaced by A. The 
present value of A is therefore unrelated to the one used in the vacuum polariza- 
tion. Should it be necessary, for large but finite cutoff, to study the A-dependent 
parts, it would be wise to introduce a uniform regularization procedure. We 
therefore use the substitution 


oO co 
| dp eip(orcap? — au? —a2m?) _, | dp (ete (ostap* — aru? —a2m*) _ e~tparA*) 
o Pp o p 


——_ + = O-— = —@ 6 ee 


Figure 7-8 The electron propagator expressed as a power series in the self-energy ©. 
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where all other massive parameters have been neglected as compared to A2 when 
the latter tends to infinity. The identity 


lo @] 
\ dp (eel: +e) = elp(z2+ie)) — =In 2 
| o P . 
allows us to write 


a a {* BA? 
»"(p, A) = \, dB (2m — Bp) In eee (7-30) 
The A dependence affects the coefficients of a linear term in p, the remaining 
expression being finite. 

The situation is slightly complicated by an infrared problem. As long as pi 
remains finite the quadratic form apaaaeing in the logarithm is positive for 
p> “ (m + 1)”. Indeed, for 0 < B < 1 and p” below threshold we have 0 < Bi — p) 
<4and 


(1 — B)m?* + Bu? — BL — B)p? > (1 — B)m? + Bu? — B(L — B)(m + pw? 


= [(1 — B)m — Buy? 
In this region )° will be “real,” that is, i — = 79S "ty? = )* Beyond this 
point )”* will acquire an “imaginary” part corresponding to the process 


Virtual electron > real electron + photon 


This is analogous to the photon case beyond the threshold for pair creation. The 
bothersome point here is that for u? — 0 the threshold coincides with the physical 
mass shell p* = m’. In the real world it will not be possible to isolate in the 
propagator the electron pole from a cut starting at the same location. This 
explains our insistence on keeping p:* > 0. When yp? = 0, one of the zeros of the 
quadratic form reaches the limit 8 = 1 of the integration interval, and expansions 
around the singular point p? = m? are no longer possible. 


S 


A similar but distinct phenomenon occurs also in the photon case at much higher orders (four or 
more loops) corresponding to intermediate states with three, five, ..., photons. It leads to power- 
type corrections to the large-distance behavior of the photon propagator with extremely small 
coefficients. Due to its smaliness this effect has not yet been measured. 


We may, however, consider the limit u? +0 for values p? < m? without 
encountering singularities. This has the virtue of greatly simplifying the integral 


in Eq. (7-30), leading to the result 
2 2 
m 


Yi oe0= obi SS (2m 5) + am[t 4S 


p 
Leumit), (ae = 
~5 05+ oO l(t P+ (7-31) 


As p? tends to m? this expression remains finite but its derivatives become infinite. 
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Insertion of such a self-energy in formula (7-28) obviously shifts the position 
of the propagator pole and modifies its residue. Two equivalent attitudes are 
again possible. Either we call mo (and eg) the parameters occurring in the original 
lagrangian, define m to be the modified position of the pole, and reexpress 
the Green functions in terms of physical mass and coupling constant. Or we 
avoid mentioning bare parameters and introduce counterterms in the lagrangian 
in such a way as to keep the position and residue of this pole unchanged. 

We may do that by expanding ) considered as a function of p(p? = p’) 
around p = m: 


YB, A) = dm(A) — [Zz (A) — 1] — m) + 23 (A) Dal) 
on =) (ee (7-32) 


0 
—[Z7*(A) — 1] = Hy, A)|p=m 


To order f the factor Zz '(A) in front of )',(p) can be ignored and the first 
equation appears as a Taylor expansion since 


0 
LePlrem=0 and YS a(A)|p-n = 0 


However, it is more transparent to interpret this relation as 


p—m— Yix(B) = Z2(A)[P — mo — YP) 


In the Feynman gauge we use (7-30) to extract 6m and Z3'. In fact, for 6m 
we can ignore the infrared infinities and apply Eq. (7-31): 


, Sou NW eal 
dm*(A) = m res (in = + 5) (7-33) 


To extract Z; *(A) — 1 we have to return to (7-30) and keep the contributions 
singular or finite when y* > 0. This yields 


[Zz (A) — 1]* = — are A, 1)| pom 
_ a |! ae 20 peo. 
2n | ae {i F = ma | (1 — BP + Bu2/m? 


Cele A? pw? 9 s 


The renormalized value of Y is obtained by subtracting 6m — (Zz! — 1)(p — m). 
In the range | p* — m*| >» p? we use (7-31) with the result 
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a 5 m? — p? p? p 3 m2 
Samant ee o(-B) 5G 


4 (a2y2 p 2 
+ me in(t - 2) | 4 (pe: m(3 +n) (7-35) 


For p? — m? » ,? the logarithm acquires an imaginary part obtained by analytic 
continuation above the cut starting at (m + y)?:In(1 — p’ jm’) — In(p?/m? — 1)— 
in. The corresponding absorptive part of Yin the limit ju? + 0 is 


abs )'4(p) = 2(1- | — ; (: + =) | 6(p? — m?) (7-36) 


A separate study is required to investigate )’, in the vicinity of p? = m?. By 
insisting that the renormalization be carried on-shell, additional infrared singu- 
larities originally absent from Eq. (7-31) have made their appearance. 

Let us see how the previous results valid in the Feynman gauge are modified 
in an arbitrary gauge. In other words, we want to compute be which we rewrite 
in momentum space as follows: 


a (p) -- | k (k? — pp? + ie)(k? — w/A + ie) 
K(p? — m?) k2(p — m) 
. [ea (p — k® — m? + ie = ‘| (7-37) 


Let I,, Iz, I; stand for the contributions of the three successive terms in the 
integrand. Provided we use a covariant regularization, J3 will vanish, being odd 
in k (it can only be proportional to p but the corresponding integral does not 
depend on p). After regularization we see that ee vanishes for p = m and hence 
does not contribute to 6m. In fact, J, is ultraviolet divergent. However, its derivative 
at p= is infrared singular and jy? will have to be kept finite in order to 
extract its contribution to wave function renormalization, ie. to (Zz! — 1)’. 
Finally, I, has a logarithmic ultraviolet divergence but is infrared convergent. 
For |p? — m?| > py? we find 


ee, te pare lm 
ee Nise a a 


2 D2 2 
-@=m|inA v1 au in(i-2)] (7-38) 


In the vicinity of p? = m? the first bracket [1 + (m?/p7)In (1 — p’/m?)| is to be 
replaced by 


1 In(u?/Am?) — A ln uh) 
2 E 7 «| 
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Consequently, we can split [(1 — AlAly” as the sum of a renormalized part plus 
a term vanishing linearly with p — m in the following form valid for | p? — m*| >» 


ae 
(aa) ole Fe ee Leal 
Se a be 
ae p? — m? m2 ( F) 
o 1 ge 
a A i p’ pe m2 


a3 ) 
~¢-m1- 75" n(1-5) 


2 Dives 2y ae. 
, nu /Am?) — 2 1n (u2/m | (7-39) 
aoe, 
Therefore 
Pe ee (7-40) 
[Z2(A) =) " m? mm OCU. an 


In formulas (7-39) and (7-40) the meaning of the cutoff A is the same as the one 
used for the computation of )"*. 


Considering the combination (252 — 1)°*> we observe that the infrared singular term In (u?/m?) has 
a coefficient proportional to 3 — 1/2. The choice 4 = 3 (Yennie and Fried) eliminates the infrared 
divergences to this order. In the limit p* +0 it corresponds to a photon propagator —i(g,,/k? + 
2 kpk/k*). As a consequence, 5" ” is also free of infrared singularities. 


We have now to compensate the ultraviolet divergences by adding counterterms 
to the lagrangian in order to maintain the proper definition of physical parameters. 
The original lagrangian contains a term —mww. This is replaced by — mo = 
—miy + dm. To order one we therefore introduce 


6L5} = dm (7-41) 
a 3e AY 
with dm = 6m* = 4," (i oa > 5] (7-42) 


Note that 6m is gauge independent, a physically reasonable result, and only 
logarithmically divergent. The added piece to the lagrangian will contribute to 
the proper two-point function a term idm (Fig. 7-9). According to the definition 
(7-25) this will subtract 6m from ) and insure that m is indeed the physical 
mass. In the sense of formal power series, we have to order one (Z2 — 1)! = 
—(Zz' — 1)! Thus the counterterm 


Figure 7-9 Diagrammatic representation of the one-loop counterterms 


— oh + (2 DLW F v — my] to the electron two-point function. 
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s pl] i —<—> - 
2S as =t(2— 1) (; wey — nib) (7-43) 
; gerl A? ie oll teia ee O 
with Zi — = [ny + 3 I eee pe Z 
| (Z2 — 1) # (Fin + ee eee (7-44) 
will compensate to order one the divergent term in p — m. The remaining finite 
propagator [p — m — ¥%°"(p)|~! will have a pole at p= m with unit residue. 


Remarks 


(a) To order hf we cannot distinguish between counterterms of the form dmypw and Z,dmypw since 
(Z2dm)4) = 5m™1 

(b) In the Landau gauge 4 — wo the ultraviolet divergences cancel in Zz! to order one. Unfortunately, 
no unique choice of gauge eliminates all ultraviolet divergences perturbatively. 

(c) All counterterms introduced up to now have a structure similar to the terms in the original 
lagrangian, pointing toward the success of the renormalization program. 


The original observation that the electron self-mass is only logarithmically divergent is due to 
Weisskopf (1939), and the first complete calculation to Karplus and Kroll (1950). 


7-1-3 Vertex Function 


After studying the two-point functions we now face the three-point vertex function 
(eéy). The unique one-particle irreducible diagram pertaining to this process is 
shown in Fig. 7-10 and is given by the expression 


d*k | One (1 —Ak,k. | 


( {ia 7a aN! SE SE | | RRR ol st 
ee) = (—Ie) | oe pie ( — p+ i(k? — P+ ie) 


Ved ai ee 7 7-4§ 
«(1 pein ie? poh —m + ie” 2) 
Using the same notation, the zeroth-order value is y“. Accordingly we shall write 
the complete one-particle irreducible three-point function 


A.D’, p) = yu + TP’, P) (7-46) 
with Eq. (7-45) giving the first nontrivial contribution to I’,. 


p Figure 7-10 Vertex diagram to the one-loop order. 
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Ze : 
A 


— Se ae 
photon line in an electron self- 


ieee PS —-- / energy diagram leading to the Ward 
identity. 


If we recall the introduction of the electromagnetic coupling through the 
minimal substitution ~~ p — eA we may expect a close relationship between 
the electron propagator and the vertex function. This is indeed the case and is 
embodied in an identity due to Ward. 

Consider the insertion of an external photon line of zero momentum in an 
electron self-energy diagram, in all possible manners, on the internal charged 
particle propagators (Fig. 7-11). The term of zeroth order yields i(p — m)— 
—ieA, while the one-loop contribution as depicted on Fig. 7-11 leads to 
—ie>Y(p) — —ieA"T!'\(p, p), and so on. The graphical correspondence may be 
interpreted as follows. In each internal charged particle propagator of a self- 
energy diagram we substitute p — eA for the running momentum p with A, a 
fixed constant four-vector. We then expand in powers of eA and extract the 
coefficient of the linear term. This is T’,(p, p). 


For instance, if we apply the procedure to the one-loop contribution to )'(p) we find 


@ | f dt 1 Doo (1 — AMegke 
LP) > sean) | a E 425 te (Re a) 


1 
x y? ——__—_____—______ y” 
, p= ¥- chon oe A=0 
Since 
1 1 1 1 

Mn =. SS Le SS eA 34 ee 

B-k-eA—m+ie p-—k-m+ie p—-K-m+ie ~ p—K-—eA—m+ie 
after differentiation, and setting A equal to zero, we recover an expression for I“(p, p) consistent 
with Eq. (7-45). 


We could ask whether the above operation yields all vertex diagrams. The 
answer is obviously “no.” Consider in a self-energy diagram an internal closed 
fermion loop. According to Furry’s theorem an even number of photon lines are 
attached to this loop (if we agree to cut through each internal photon line starting 
and ending on this loop). Again according to Furry’s theorem, it is therefore 
impossible from the above procedure to find a vertex contribution where an 
internal fermion loop would be attached to the rest of the diagram by an odd 
number of photon lines, such as depicted on the example of Fig. 7-12. This 
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Figure 7-12 Example of a diagram that cannot be obtained from 


! 


7) Pp 2(p) by the procedure described in the text. 


means that we obtain only those diagrams where the external photon is attached 
to the fermionic line that carries the charge flow originating in the external line. 
This may at first seem troublesome if we are to find a relation between » and i 
Fortunately, the sum of all diagrams where the external line is attached in all 
possible ways to an internal electron loop vanishes when evaluated at zero 
momentum transfer g. This we can see after a suitable Pauli-Villars regularization 
of the loop by isolating a factor of the form 


y d*p tr : ) : + : 
- Sime fom ep d—m+ee 


1 
Ptdtds—m+tie i) 


where the sum of all momenta entering the loop q2 + q3 +°*: + dap vanishes. 
As before, 


x V3 


1 1 7) 1 

p—-m+ic “J—-m+ie Op’ p—mtie 
Summing over all possible insertions of the external photon for fixed values of 
the internal photon momenta we get a total derivative as an integrand. The 
integral vanishes provided it is regularized in a gauge invariant way. We con- 
clude that the prescription of attaching an external photon may be restricted to 
the distinguished set of electron propagators carrying the external charge flow. 
The same set may be seen to carry the electron momentum flow and the pre- 
scription is then equivalent to a derivative with respect to this momentum. This 
yields the Ward identity as 


= = i (p) (7-474) 


If we include the zeroth-order contribution this reads 


Auld P) = Yn + Tul, ) = i [b-m-Y@)] (7-476) 


In the next chapter this identity will be generalized for nonvanishing momen- 
tum transfer in the form due to Takahashi: 


(p' — p)“A,(p', p) = [p’ — m — Yip’) — P- m — D0) (7-48) 
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and its relation with current conservation will be discussed. We can readily show 
that (7-47u) follows from (7-48). The sketch of a proof given above obviously 
requires some care. In particular, a consistent gauge-invariant regularization is 
needed. In fact one of the goals of the renormalization of quantum electro- 
dynamics is precisely to transform the formal Ward-Takahashi identities into 
relations among renormalized finite quantities. 

We return to the one-loop calculation and drop the index 1. From Eq. (7-45), 
T(p’, p) — T(p, p) is finite, and P'(p, p) is known from the Ward identity in terms 
of )'(p). This means that 


i= = if 50) + Gi Le. P) 


zs! — 1) 4+ Teepe 2a = rp) (7-49a) 


We define the (infinite) vertex renormalization constant Z, through 
T,(p', p) = y(Zi* — 1) + Zi TEP p) (7-49b) 


Again the factor Z;/ in front of the renormalized vertex correction % can be 
replaced by unity, when computing to first order, and IR(p’, p) is normalized 
by requiring that on-shell (p'’* = p? = m”) and for zero momentum transfer, its 
matrix element between Dirac spinors vanishes: 


u(p')'(p’, p)u(p) |t=p2=m =0 (7-50) 


Clearly T’,,(p’, p) — T,(p, p) vanishes when p = p’ and so does the matrix element of 
(0/dp") Y*(p), since this operator is proportional to (p — m). Comparing (7-49a) 
and (7-49b), we reach the conclusion that 


Z2 = Zy (7-S5la) 


and rae zine hee) - af Rp) (7-516) 


The Ward identity remains valid in a theory with massive photons. We leave it to the reader to 
be convinced that a one-loop counterterm of the form 
bL Gay = —e(Z, — IAW (7-52) 


will indeed cancel the vertex logarithmic divergence to this order. 


What remains to be done is to compute 'R(p’, p). We shall satisfy ourselves 
with the case where p and p’ are on-shell and T, is to be considered as sand- 
wiched between Dirac spinors as u(p’)I’,(p’, p)u(p). We shall, however, not write the 
spinors explicitly but we freely use the Gordon relation 

= , 1 — iF , . 
iu(p')ypu(p) = =~ ulP')L(P + P')o + io pr(p' — p)’]u(p) (7-53) 
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The quantity that we intend to calculate has a gauge-independent finite part. 
Indeed, in Eq. (7-45) the term proportional to (1 — 4) involves an integral over 
1 1 
7 —h-m+iep—-h-mt ne 
We can replace k to the right by k — p + m and by k — p’ + m to the left, and 
thus find a quantity independent of p and p’ which is thus absorbed in (Zy} — 1) 


by virtue of the normalization condition. As far as (Z;! — 1) is concerned, we 
know that it depends on the choice of gauge. We therefore set 1 = 1 to write 


d*k 1 p—Kim 
; a ee ae 
l’,(p’, p) i( ie) |& ke =r # ie Yo (pay? en? + ie Yp 


p—kim , 


Me 
(p — ky —m? + ie? 


The numerator in this expression may be written using the mass shell condition as 
; k? ; 
4{,,) 0 — k)-(p—k)— c| + (p' + p—k)k — mi} 


An auxiliary integral is 


d*k ik>z 1 
(27)* (kw? + ike? — 2p’: k + ie)(k? — 2p-k + ie) 
1 © dates 2/2 — ap’ — asp)" 
= aaa \ 1 442 dX3 . exp {aif au’ ie (z/ 2P 3 i 


o (a, + a2 + 43 Oh) ap ky ae) 


The introduction of a factor e*? in the integrand allows us to obtain the required 
expression in the numerator of I, by differentiation. After symmetrization in 
a2 and a3 we obtain the following representation: 


a (° da, daz daz (p' +p) i 
p=) + 2+ a3)p°p’ — (a2 + #3) 5 — +5 
Y,(p’, p) rm | («) lems a3)° ac G2 + a3)p°p’ — (a2 3) 5) 5) 
I : ' i. at (a + 2.) 

_— ————.- + 043)? — (Pp +p), — 

3 2(a%; + &2 + 43) beioketa “+ os) — if i (P+ Phe O, + a2 + 43 


i 2 ia") 2 
| ode (0 + O nm? — aif +an + ashe] 
x exp | [eat (a2 3) 1 


where we have set q = p’ — p. When we attempt to perform the integral over the 
homogeneity scale of the a; parameters, we encounter the expected logarithmic 
singularity which we subtract taking into account the normalization condition 
(7-50). We observe that q? <0 if p and p’ lie on the mass shell. After a Wick 
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rotation a;— a,/i we obtain 


oO 3 
rR(p’, p) = — = \, da, daz du 6 (: _ 2 a) 


. ee — Galo a%3)(2m? — q’/2) + 4[(a2 + 03)?m? — a2%3q7]} 
m? (a2 + 03)? + o&1H? — 203g" 


(p’ + P)p%1(%2 + a3) 
m? (a2 + a3)? + op? — 02. 43q7 


+ (m/2) 


1 ; 
ate 5 Ye In [m? (a2 + a3)? + ap? — arsae?] —(p' = p,q=9) 


since from (7-51b) the renormalization of the on-shell vertex function to order one 
amounts to a subtraction at q = 0. We isolate the form factors F,(q*) and F(q’) 
defined as 


i 
pep) YF i(q) + ae 6 pvq’ F2(q7) (7-54) 


We are, of course, only computing the contributions of order « to these form 
factors. The term of order zero adds one to F,. We find 


(oe) se 3 
F,(q?) = — = { da, da daz a(1 _ 2 a] 


$ m? — q?/2 — (a2 + a3)(2m? — q?/2) + m?[(a2 + 43) — H(a2 + 43)7] — a203(q?/2) 
m? (a2 + 03)? + ay? — a203q7 


! 2 
2 Tae 1 2 3 , J m?(a2 ae a3)? ae op? a 02 03.q7 S 


The momentum transfer can be parametrized in terms of an hyperbolic angle 6 
such that 


2 


p:p) =m’ cosh@ = q? = — 4m? sinh? ; (7-57) 


The easiest calculation is that of the magnetic form factor F,. It does not involve 
any infrared divergence to this order so we can set p? = 0. Integration over 
yields 


oa 2 + %3 — (a2 + a3)? 
F,(q7) =— deada, 00 — o, — 3) eS eee 
(7) a gee © ‘ 2) Saga aE 


Introducing a new homogeneity parameter, this can be further reduced to 
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ae 1 1 
Peg \)=— | @ondaxd(l — 0. — = 
2(q ) Tt \ AX AX; ( A a3) a2 +4 ae + 20703 cosh 0 \, dx (1 x) 


dB 
1 4p 1 
o B {1+ [0 — pple {1 (i —pyepe 


The last integral can easily be performed using (1 — f)/f as an integration variable 
and the result is 


ree te 1 
+ B? + (1 — B)? + 2B(1 — B) cosh 8 


a iG 


F. 
2(q°) a sinh 6 


(7-58) 
In particular, when q? > 0, 0 > 0 and 


F,(0) = (7-59) 


= 
We turn to the calculation of F,, paying attention to infrared divergences. After 
integration over x, we find that F; may be expressed as the sum of four terms: 

4 

F,(q’) = )° 7; + constant 

j 
and the constant is adjusted so that F,(0) = 0. The various contributions in the 
limit of small yz? are as follows: 


Fa 2% “4 ae cosh 0 
oN se 0 ‘ie a3 + a3 + 2203 cosh 6 + (u?/m7)(1 — a2 — «3) 
of a 1 
= ~Z cosh 6 | ip | dx 
— Gt 0 0) 
oe 


x?[B? + (1 — B)? + 2B(1 — B) cosh 6] + (u?/m?)(1 — x) 


OF erp “a In [B? + (1 — B)? + 2B(1 — f) cosh 6] — In(u?/m7) 
oS P B? + (1 — B)? + 2B(1 — B) cosh 8 


Changing variables aes to 


tanh @ 


oo? ~ ean (6/2) 


reduces the integral to 


6/2 
F =%coth on & — cath 6 | do o tanh @ 
Tt 
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The other pieces of the puzzle are 


i 1-—a 
a O2 + 43 od 
FZ =—cosh 6 da daz +5 <5 = = 8 coth 6 
tT 0 0 a% +03 +20,03;cosh@ 


ae 2 - : - 0b 3 ate 6 1) 
PS Man) 0 “ 0 3 2 + a8 + 2a03 cosh 2n sinh 6 


ao 1 1—a ; ae 
te = | dos | da3 In| (x2 + x3)? + 4%2%3 sinh 5 


0 0 


ek. 0/2 
~ Qn tanh (6/2) 


Finally, adjusting the normalization, we obtain the complete expression 


+ constant 


OL m af 6 @) 
kulg = (mn + 1) @coth @—1)—2cothé dy » tanh » — — tanh = 


: 4 Ps 
(7-60) 
In the vicinity of g? = 0 we have q?/m? ~ —6? so that, to leading order in q’, 
2 
aq m 3 
Fil@ eo = Seen (in Lb aes ;) (7-61) 


This means that the complete expression of the renormalized vertex function 
evaluated between spinors takes the form 


(p+P')o ag? ea: i om 
ep EG a == || aes ee) | Oa i 
2m 32m? a u «8 2m ea 2n me 


By analytic continuation in the complex variable @ we obtain I’? for positive 
values of q’. Setting, for instance, = iy we reach the expression valid for q? 
between 0 and 4m’, where F, and F, continue to be real to this order: 


gq? = 4m? sin? e O0<W<zx (7-63) 
where 
F, =2}(n-4 4 1)iyreot y = + 2eoty | dp prano + Stan S| 
1 m 0 2 2 
Par . (7-64) 
2" In sin W 


A useful exercise is to continue these expressions beyond 4m? and give an interpretation of the 
imaginary part of the form factors. What is the behavior for large |q?|? 
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7-1-4 Summary 


Let us present a summary of what has been achieved by eliminating ultraviolet 
divergences. Up to now we have defined three divergent constants: 


‘I. Z, = Z2, gauge dependent as well as infrared divergent, but equal by virtue 
of the Ward identity. 

2. Z3 and om both gauge independent and infrared finite. We have computed 
Z,;—1=2Z, -1,Z3— 1, and 6m to order h in such a way that the lagrangian 


1 ri A j <> = =: 
B= - GP +S MS (0-AP + TF U - my — ev Ay 


1, ee jaa ae Wn 
ag 5 — We + Za 0(5u Umi) + 26m 


—e(Z, — 1A (7-65) 


will produce finite renormalized two- and three-point functions, up to the same 
order. We shall see that the same holds for higher Green functions. 


The photon “mass term” has been denoted (ji7/2)A?. As we noted, the true position of the pole in 
the massive theory is displaced by radiative corrections by an amount related to the wave function 
renormalization constant Z3, since the vacuum polarization tensor has only one overall divergence. 
It is simpler to define the corresponding dimensional parameter as (minus) the coefficient of the term 
Joo in the inverse propagator at zero momentum: 


7? Ho Za (7-66) 


Here ji stands for the (infinite) bare mass. Therefore, fi? A? = Z3u3A?. In a massive vector boson 
theory the real mass p would be related to fi by a finite renormalization factor. 


It is now necessary to convince ourselves that higher Green functions are 
finite, since we have now exhausted our freedom of choice by selecting the 
measurable mass, coupling constant, and pole residues. This analysis is of interest 
because there still exists a potentially dangerous amplitude for photon-photon 
scattering. If we were dealing with spin-zero charged bosons, a new physical 
constant, apparently absent at the level of Born diagrams, would emerge from 
the needs of renormalization. In this case we have to give the normalization 
of the charged particle elastic scattering amplitude. 

The mechanism responsible for ultraviolet divergences in one-loop integrals 
may be characterized as follows. If k denotes the momentum flowing through the 
loop we are interested in the large-k behavior of the integrand in the Feynman 
integral. Each fermion propagator contributes a factor |k|~* and each boson 
propagator a factor |k|~ *. Vertices with a y, coupling are k independent. Proper 
functions are classified according to the number of fermion external lines zero, 
two, four, etc. Typical diagrams are shown in Fig. 7-13. 
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Figure 7-13 Examples of one-loop diagrams with zero, two, four, .. ., external fermion lines. 


If A stands for some ultraviolet cutoff, the behavior of the integral will be 


Ne tae 4—I,—2Ip>0 
In A 4—I,—2Ig=0 


with I-(Is) denoting the number of fermion (boson) lines in the loop. The integral 
will be superficially divergent if 


ol(Gy=4 = 1p —eineaO (7-67) 


where «(G) is called the superficial degree of divergence. In electrodynamics the 
kinematics somewhat complicates the matter. We have already encountered 
examples such as in the case of vacuum polarization (Ig = 0, Ir = 2) where the 
above counting seemed to indicate a quadratic divergence w = 2, but current 
conservation allowed the extraction of a factor (k*g°’ — k°k’) leaving only a 
logarithmic divergence. Similarly for the electron self-energy w = 1, but again the 
fact that } was found proportional to the dimensional quantities m and p 
reduced the degree of divergence by one unit. For the vertex function the counting 
is correct and w= 0. But even in this case only one form factor was found 
divergent. We may express w(G) in terms of the (even) number of external 
fermionic lines Ey and of the number Ex of bosonic external lines by using the 
fact that two fermion lines and one boson line meet at each vertex. Along the 
loop the number of vertices V is equal to the number of internal propagators 


V= Ip + Ip 
and according to the previous argument 
2V=2Ip+ Er V =2Ip+ Ep 
since each propagator meets two vertices. Eliminating V, Ip, and Ig we find 
w=4-3E,— Eps (7-68) 


Therefore we derive the following table: 
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Superficial degree Effective degree 


Er Ep of divergence of divergence 

0 2 ~)- 2 0 

0 3. Excluded by charge conjugation invariance 

0 4 oO 0 (7-69) 
8: lle ae 0 

plot — 0 0 


As the number of external lines is increased the integral becomes more convergent 
and table (7-69) shows that we have exhausted all possibilities except for photon- 
photon scattering. However, the latter is in fact convergent, again due to gauge 
invariance, which dictates some kinematical structure leaving a well-behaved 
integral. This follows from our earlier discussion of the Euler-Heisenberg effective 
lagrangian (Sec. 3-3-4) and will be discussed in detail below (Sec. 7-3-1). 

To order h we now have a finite predictive theory satisfying perturbatively 
the general principles of unitarity and causality, since the counterterms amount 
to subtractions in dispersion relations, i.e., they do not modify the analyticity 
properties nor the discontinuity rules. Another confirmation stems from the fact 
that we are using a local hermitian lagrangian. 

Let us now return to Eq. (7-65) and reorganize the various terms as 


1 Pets _ 
L= ae + 5 HOZ3A” F(a? + Z| 50 Fy — Om imi 


— Z,ewAw (7-70) 


This expression leads to a natural interpretation. The fields y, w, and A occurring 
here are renormalized fields corresponding to propagators with residue equal to 
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unity at the one-particle pole. Define unrenormalized fields Wo, Wo, and Ap and 
cutoff-dependent bare parameters Ld, Mo, €o according to 


Wo = Z2'7(A)W = mo = m— dm(A) 
Wo= Zi7(NW we = Z3 (A)? (7-71) 
Ao = Z4/7(A)A €o = Z1(A)Zz '(A)Z3 1/2 (A)e = Z3 */7(A)e 
Then the lagrangian (7-70) expressed in terms of bare quantities is similar to 
the original lagrangian 


Se ee a a 
eee = 525 (A)(0 > Ao) + 5 Wo Wo — MoWoWo — CoWoAoWo 


4 2 
(7292) 


Not only do finitely many types of counterterms suffice to render the perturbative 
series term-by-term convergent (a property which will be preserved to all orders in 
h, as we shall see) but the very structure of the theory is preserved by renormaliza- 
tion. 

It is worth emphasizing the role of the Ward identity, a result of which is 


€9 Ang = CA (7-73) 


giving a meaning to the scale of the electromagnetic interaction and preserving 
the meaning of minimal coupling. 

Note also that the A? and (@- A)? terms were not affected by counterterms, 
a special property of electrodynamics. In the same way that we introduced a 
bare “mass” yo we can define a bare parameter A, through 


Ag = Z3*(A)A 


The lagrangian (7-72) gives rise to renormalized Green functions defined as 
follows: 


Go(P1,--+5 Pans K1,--+» ki Lo, Mo, €o, Ao, A) 
= Z3(A)Z3° (A) Gr(P1, +++, Dan ki, sees ki; HL, mM, @, A) (7-74) 


Here pj, ..., Pan (ki,.--,k,) Stand for the external fermion (photon) momenta. Again 
this holds perturbatively in h, and up to now we may only assert this up to order 
h'. The relations (7-71) and (7-74) justify the denomination of multiplicative 
renormalization. 

The wave function renormalization constants Z}/? and Z}/? appear to be 
infinite when expanded perturbatively. This shows, at least in this sense, that one 
of the working hypotheses of this approach, namely, the canonical equivalence of 
the free and interacting picture, is not mathematically well founded. We have 
succeeded, however, in overcoming this difficulty, and we shall discover later 
on that there is some interesting physical meaning in these apparently infinite 
quantities. 


The extension of this program to all orders will be the subject of Chap. 8. 
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7-2 RADIATIVE CORRECTIONS TO THE INTERACTION 
WITH AN EXTERNAL FIELD 


The remainder of this chapter deals with examples of radiative corrections. One 
of the issues will be to make sense out of the infrared divergences arising from 
the long-range electromagnetic forces. This survey is only indicative and without 
any pretense to completeness. 


7-2-1 Effective Interaction and Anomalous Magnetic Moment 


In Chaps. 2 and 4 we have already presented various aspects of the interaction 
of charged particles with c-number external fields. Here we want to discuss 
quantum corrections generated when we substitute to the quantum field A, the 
sum A, + A,, where A, is the classical field. 

To first order in A, we derive an effective interaction represented by the 
first three diagrams of Fig. 7-14 including one-loop quantum corrections. This 
means that the elementary interaction ey,A? has been replaced by 


Aap €Any,+ 1, + OG ya) (7-75) 


where I and @ have been computed in the preceding section and G is the free 
photon propagator. We have assumed the electrons on mass shell, disregarding 
therefore the self-energy insertions on the external lines which are assumed to be 
absorbed in the correct definition of the mass and proper normalization of the 
states. 

Atlow momentum transfer q = p’ — p, keeping only dominant terms as p > 0, 


we obtain 
5 
me ag m2! \\ 1 1-16 
aa ab Ppa lite, SS bien 74 (7-76) 


When deriving (7-76) the gauge-dependent terms of G’’ proportional to q’q° 
have not contributed when acting on the vacuum polarization. Also a term in 
¢ = p — phas disappeared when evaluated on the mass shell. Consequently, the 
remaining expression remains gauge invariant in the sense that if A? is replaced 
by A’ + q°f(q) the added term vanishes on the mass shell. . 
The quasistatic limit q > 0 allows a simple interpretation. In configuration 
space the momentum transfer q can be replaced by the derivative operator id 


A, AN 
=p'-p } | 


Figure 7-14 Lowest-order contributions to the interaction with an external field. 


348 QUANTUM FIELD THEORY 
acting on A,. Using Gordon’s identity we obtain the two equivalent expressions 


of the interaction hamiltonian of a slowly varying (in space and time) external 
field with a charged spinning particle: 


i ; 
AHiyes aC « fotcayovt9] —— (i an Ln Jat 


— ae W(x jovavcuie Atta 


a 
oF 
a 
se [ae ae Biss 20o| so (n” - 1) [aren 


+(1 af a) Zo Ux)ovp WO) Fe 00} (7-77) 


In the second form the first term introduces a convective current analogous to 
the one pertaining to scalar particles, and its interaction with A, is affected by 
infrared divergences. The second term has a simple interpretation. In the limit 
of a constant F, it reduces to an effective magnetic dipole energy. Let us assume 
that F represents a constant magnetic field F1? = — B°, F?? = — B’, F*! = —B’, 
012 = 63, etc. It then reads 


—B-p=-B:- [s (: + £2 [ex W(x) 2 Ho | (7-78) 
2m an 2 


When discussing the Dirac equation we found a gyromagnetic ratio 2 leading 
to a magnetic dipole moment 


We see that to order hf quantum corrections modify this gyromagnetic ratio 
written in the form g = 2(1 + a), where a is called the anomaly, by a= x/2z. 
This result first derived by Schwinger in 1948 has been confirmed by numerous 
accurate experiments, which enable us to test higher-order corrections. 


The electron anomaly is mass independent to lowest order. Therefore, the same result applies as 
well to any elementary spin 3 particle such as the muon. A complete theory to higher orders 
requires the consideration of all charged particles interacting with the electromagnetic field. This 
leads to a difference in these anomalies arising from mass differences. At the present time, calculations 
have been pushed to third order (three loops). They read 


att lo a \? ae 
at) = oa 0.328 478 445 = + 1.183(11)[—] = 1159 652 359(282) x 1071 
7 
(Numbers in parentheses indicate uncertainties.) The latest measurements are of a similar accuracy 
and are likely to be improved in the near future: 
aS’? = 1 159 652 410 (200) x 107? 


To cope with such an improvement the theoretical program involves obtaining the (a/z)* term with 
greater accuracy and evaluating the next order [since («/z)* ~ 29 x 107 !?] represented by as many 
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as 891 Feynman diagrams! Minor contributions involve the insertions of a muon-loop vacuum 
polarization correction in the virtual photon propagator (~2 x 107 !2) together with hadronic (~ 1.4 x 
10° '*) or weak (~ 0.05 x 10~!2) contributions. The measurement of the electron anomaly is likely 
to become in the near future one of the most precise means of obtaining an accurate value of the 
fine structure constant «, by a unique combination of theoretical and experimental skills. 

In the case of the muon anomaly the uncertainties on hadronic corrections set a limit on the 
accuracy of theoretical predictions. The most recent measurement gives 

a,” = 1 165 922 (9) x 107° 

The pure electrodynamic contributions are 


3 


1 2 4 
avd — — ~ 4 0.765 782 (=) + 24.45 (0.06) (*) + 128.3 (71.4) (:) 
22 t T 


7 


= 1 165 851.8 (2.4) x 10-9 
where the x* term is an estimate using the large ratio m,/m., and the hadronic correction is of order 
qhadt _ 66.7 (9.4) x 1079 
The theory is in reasonable agreement with experiment with a prediction 
at® — 1 165 919 (10) x 107° 


Weak interaction effects are expected to contribute at the level of a¥®** ~ 2 x 1079. 


The first term in the effective hamiltonian (7-77) appears troublesome in view 
of its infrared singularity. This arose from our insistence on having an isolated 
pole in the corresponding Green function. However, such a state can never be 
separated from those including an arbitrary number of soft photons. The latter 
are always excited as soon as a charged particle suffers a change in velocity, 
no matter how small. Therefore we have to include processes involving real 
emission or absorption of soft photons in the external field as soon as we discuss 
the virtual electromagnetic effects embodied in Eq. (7-77). The fictitious photon 
mass disappears from the final answer to a physically correct question, being 
replaced by the experimental resolution. 


7-2-2 Radiative Corrections to Coulomb Scattering 


As an example let us consider Coulomb scattering on a nucleus of small Z so that 
Za may still be considered as a small parameter. Atomic corrections (a screening 
effect in a first approximation) will be neglected. Using the effective interaction 
(7-75) let us compute all corrections up to order «° to the Mott cross section 
[Eq. (2-130)] for unpolarized electrons. If E and p stand for the energy and 
magnitude of the electron three-momentum and f denotes its velocity B = p/E, 
we recall that the scattering cross section at angle 9 was given by 


do Lo ‘ae 5) 
La. ae a (7-79) 
(=) 4p? B? sin* aa ( ser 


Since the first corrections will be of order Z*«* it will be mandatory to include 
the second Born approximation whose interference with the leading term will 
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produce a contribution of order Z*a?. This is comparable with the effect of vertex 
and vacuum polarization renormalization provided Z is small enough. The interest 
of pushing calculations that far is that it also enables us to discuss another aspect 
of infrared singularities related to the long-range character of the Coulomb forces. 
The latter induces an infinite phase shift on the scattered plane waves. To prevent 
it we may introduce a screening factor which in a consistent theory would be 
related to the fictitious photon mass u. In order to show that this type of infrared 
divergence is cancelled in the cross section we shall distinguish it by introducing 
an independent screening length o and by studying the limit o — 0 separately. 

The diagrams contributing to the scattering amplitude are shown in Fig. 7-14 
with the external potential A, equal to the screened Coulomb interaction 


Zee d*q 5(qo) ; 
0 iqg> x 
A,=0 A; (x) = — = Ze Fes ) a e 


4nr 
The first three diagrams (1), (2), and (3) of Fig. 7-14 yield 
° 2! , 1 _ , 0 a 2) 
$123 = —iZe Cx)o(po — Po) = — a 1+- 1 + In — (2g coth 29 — 1) 
q-—o T m 
a 1 
—2coth29 | dg’ o’ tanh g’ — 5 gy tanh @ 
0 


coth? @ 1 q“« @ 
+ (: — 3 eo coth g — 1)+ ;| ~ Dai DO u(p) (7-80) 


with p: p’ = m? cosh 29 and q = p’ — p. The fourth diagram adds the quantity 


d*k _ | 2n6(po — ko) i 2n6(k° — p°) 
a 2 eri Ne 0 a ee 
See (a |< ww) —k)-—o its k—m-+ ie ae (k — p)?? —o? ulp) 


From the kinematical constraints, p° = p° = E and |p| = |p’]. Set 


fae: | fe a. 
: [(p —k) + o7][(p_— ky? +o]? — Kk? + is) 
1 k 
ee i ee 
oO ae | [ek + op +o] — +) 
Then S4 takes the form 


Zao 


S4= —2i 276(Po — po)u(p’)[mU, — Iz) + yE(, + 12)]u(p) (7-81) 


7 
We are, of course, only interested in the small o limits of I, and J,. Feynman’s 
identity 


1 --3 [a 1 
ote wal” pee 
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enables us to write 


1 
ee 3 
: i, 4B MoM = [a ae es ie)[(k — P)? + M?] 


where M? = a? + 4B(1 — f)p? sin? (0/2) and P = fp + (1 — )p’. Similarly, 


1 , deh Lee 3h 1 
50+ poh = |) ‘8 (555 seam) | GEO 


The intermediate integral is readily performed with the result that 
Pk 1 _ int PoP +iM 
(p? — k? + ie)[(k —P)?+M?] P  p+P+iM 


with an adequate choice of branch cuts to define the logarithm. The corresponding 
expressions for J, and J, in the limit o + 0 are 


2 . 
7 - 2p sin (6/2) 


ft = 3553 sin? (6/2) F eis 
1 Tt 1 1 2p sin (6/2) a 
1a = ogo (0/2) 5 i ~ sin ai . il a 7) al sal 
(7-83) 


Inserting these expressions in Eq. (7-81) we obtain the last matrix element. The 
unpolarized cross section is therefore given by 


da 4Z?«a?m? 1 5 
Se eee aly’ 7-84 
10 lal? 5 Lipor | CP’) Tu(p) | (7-84) 
with T standing for the coefficient of (iZe?/|q|?) 276(p° — p’°) in the sum S123 + 
Sa. that is, ‘ 
d 


T=y+ A+ y= -B+e 


Q 
A= as i(: + In “V0 coth 2g — 1) — 2 coth 29 \ dg’ ¢’ tanh g’ — = tanh @ 
T m 0 


1 Z 
ae i eemaxe (yg coth g — 1)+- = asa) (7-85) 
3 9 2n 
a Q 
F sinh 2 
C= — <5 mlql? (isis) 


To be consistent, in Eq. (7-84) we keep only terms up to order a. Note that 
the only complex quantities are J, and I,. The sum over polarizations yields 
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ay ptm —p +m 
m? Voor | HP) Tulp)|” =m? tr (x er if ie 
= (1 +2 Re A)(2E?)[1 — f? sin? (0/2)] 
+ 2B(2E*)f? sin? (6/2) + 2 Re C (2mE) + O(a’) 


All the dependence on the screening factor o which was occurring in the imaginary 
parts of A and C has disappeared as expected, and the elastic cross section 
including the first nontrivial radiative corrections reads 


(<) = (<5) ‘ ae cai ( + In *\ 30 coth 2g — 1) 
dQ elastic dQ Mott " a 
a ” 
— 2 coth 20 | dg’ g' tanh g’ — > tanh @ 
0 


coth? 1 Q B? sin? (6/2) | 
7 (: a ) @coth o - Det 5 — Sane 1 eae) 


+ Zon (7-86) 


B sin (6/2)[1 — sin aaa 
1 — f? sin? (6/2) 


We recall that E and p are the electron energy and momentum, f = p/E its 
velocity, @ the scattering angle, and cosh 2g = p:p’/m’, sinh o = p/m sin (6/2), 
and |q|? = 4m? sinh? o = 4p” sin? (0/2). 

In Eq. (7-86) we are left with the genuine infrared divergence. To obtain a 
sensible result we will have to compute and add the cross section for emitting 
soft photons. 


7-2-3 Soft Bremsstrahlung 


In Sec. 5-2-4 we have already encountered electron bremsstrahlung in a Coulomb 
field. We obtained the Bethe-Heitler formula exhibiting a typical dw/w spectrum, 
where w is the photon energy or, what amounts to the same, the electron energy 
loss. If we consider a typical energy resolution AE we would expect by integration 
a probability of radiation of the order of Z*a? In (AE/u), which might well com- 
pensate the analogous term in the elastic cross section. Our present objective is 
hence to integrate the Bethe-Heitler expression on photon variables in a range 
which we define as w < AE. We shall assume that uw «AE «E. An objection 
may be raised to the effect that in Chap. 5 we practically assumed that p ~ 0. 
In fact, the only dangerous term in the limit u/E «AE/E —0 arises from the 
propagators. We may therefore use the formula (5-151) valid for vanishing photon 
momentum k. If this contribution is denoted [(do/dQ)(AE)] we have 


inelastic? 


do do ak. | 2p m? m? 
rae (AN) =| Sa human © mae aa 0) 
dQ, inelastic dQ] on Josae 2@(27) k-pk-p’  (k-p) (k- p’) 


(7-87) 
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The resolution could vary with the kinematical conditions, which would 
require a more careful evaluation using the complete Bethe-Heitler expression. 
Here we only want to demonstrate the basic compensation mechanism. In formula 
(7-87) we have in principle E = E’ + w, but we are only looking for the dominant 
emer in AE. Therefore, whenever allowed we shall take the limit E’ > E, 
|p = (k* + y’)'?. Let B denote the coefficient of (do/dQ) 
Using er S auntie we write 


| d?k | 2p-p’ m?* m? 

An? J-1  Josaz 20 [(k-P?  (k-p)? (kp? 

where P = 3[(p + p’) + <(p' — p)]. Taking into account that |k| < [(AE)? — p27}! 
~ AE we find 


er 


ok an d|k |k|? 
BeAr w(k+ P)? p (k? 7 fe) oe [k?(P2 = P?) ma qos ] 


arctan (AE/j:) sin yw tan 
=4n pl. plane) 
, P3 — P* sin? yp 
_ 4 2AE Po oe 


Inserting this in the expression of B with the notation p= |p|, p’ =|p’|, and 
, which we hope will not confuse the reader, we are left with one- 
dimensional integrals over z: 


o 2AE poe 1 E a+ iB 2 + p’ 
ee ine | 14 22 ——— In 
i 24 in | sae | ara 5B 2p Bes 


pee | es 1 Po(z) , Po(z) + eet 
Po(z)* 


ai —P(z? Plz) Po(z) — P(2) 


We observe that Po(z)* — P(z)? = m?(cosh? @ — z? sinh? @), using our previous 
notations. Consequently, 


BER “- i =scosh' 2 + : = 29 coth 2 
— ss Ga a 6 ? cosh? Oz smh? @ 

For the last integral we introduce the variable € = P(z)/fPo(z) and take the limit 
E’ > E, p' > p up to corrections of order AE/E. We see that € is also equal to 
[cos? (6/2) + z? sin? (0/2)]*/? with 


‘ 


pp’ | a 1 Po(z) ,_ Po(z) + P(z) 


, ero) Fol RO=ro 


= cosh 2 B sin (6/2) <a oT = ies |e fet cos2 (6/2)] 1/2 oa = (22 
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Putting everything together, the soft photon emission cross section reads 


do da 20 2AE 1 + 1+8 
—. 2 th 2g — 1) ln — + sin 
er ap| ( a) z ere i H 2p Tae 


dQ 
1 1-f? f? er 
= 7 008h 20 FOP) Foon PELE — cos OP "1 = BE 
(7-88) 


7-2-4 Finite Inclusive Cross Section 


The physically measurable quantity, which we denote by (da/dQ)(AE) is the sum 
of elastic and inelastic contributions 


do 
G (Ae) = ( «| +| Ze (7-89) 
dQ elastic dQ inelastic 


Both terms are evaluated to the same order «° and are given respectively by 
Eqs. (7-86) and (7-88). Schwinger, who first computed these corrections, has 
introduced the notations dr and 6, for the relative corrections arising from virtual 
or real photon emission and second-order Born approximation. In this way 


do 
mu» - ( 


When adding the results of Eqs. (7-86) and (7-88) the infrared cutoff drops out 
altogether, as expected. The fictitious mass yu has only been useful to give a 
meaning to the intermediate steps of the calculation and the 1 +0 limit is now 
perfectly legitimate when care has been taken to allow for the experimental 
resolution AE. The final result is, of course, sensitive to AE. 

When adding the two contributions we may use the following identity which 
can be obtained by comparing the changes of variables performed when integrating 
(do/dQ) elastic aNd [(do/dQ)(AE)]inetastic: Recalling that sinh o = [BA1 — p?)'?] x 
sin (0/2), we have 


) (1 — dp + 53) (7-90) 
Mott 


° __@ 7 (— pale 
I dy w tanh py = 5 In (1 — B?) + 4 sin (2) sinh 2¢ 


‘ hee a (1 = Bp? a) se — cos (ay 


Hence we find 


= 2AE 
Or= aa 20 coth 20)(1 +I AE) + otanh 9 +( — g tanh @) 


coth*g\ 1 1, 1-8 
«(1 3 )— 5+ sn Fo coin 29 mca — p 
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B* sin? (6/2) Q 1 Le? 
1 — BP sin? (6/2) sinh 2p * 2°92 Fay 


1 1 In(1+ 82) In(1 — Bé) 
a | Nee NP) 2 
: pe é (ie? =a cos? (0/2) me I 7 BE 1 ne BE i u se 


__ Bsin(6/2)[1 — sin (0/2)] 
i a—€ CA 


(7-92) 


Numerical investigation reveals that these corrections are far from negligible as they increase with 
energy. When AE/m -— 0 the previous formulas become invalid. A criterion for validity is 


x om 
—-In—<1 
™ 


To go beyond this result, it is necessary to include higher-order terms involving 
the emission of several or even an infinite number of photons. The contributions 
of soft photons is then known to factorize in exponential form, showing that the 
process has zero probability in the limit AE/m — 0. 


f-3 NEW°EFFECTS 


Higher-order terms in the perturbation series induce new effects, some of which 
will now be briefly presented. 


7-3-1 Photon-Photon Scattering 


Four photon interactions have no classical counterpart and arise through quantum 
fluctuations of virtual charged particle pairs. From a theoretical point of view it 
is of interest to demonstrate how gauge invariance and current conservation 
cure the remaining potentially divergent amplitude, as was anticipated in the 
discussion of Sec. 7-1. 
Let us first sketch some dimensional arguments. The basic amplitude (Fig. 7-15) 
is of order « so that the cross section takes the form 
3 3 4 v2 
pa |e Oe IMT pe, aks (7-93) 
3 M4 k, 5 k3 


with obvious notations for the momenta and energies of the photons. The dimen- 


Figure 7-15 Basic diagram for photon-photon scattering. 
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sional parameters are, for instance, the common energy @ in the center of mass 
frame and the electron mass m. Here M stands for a dimensionless matrix element 
so that o behaves correctly as a surface. 

We expect gauge invariance to allow for the extraction of four powers of the 
momenta from M. This will be subsequently confirmed. Consequently, |M|* must 
behave at least as (w/m)® for small w. In this limit k, +k is proportional to wo", 
We therefore predict that at low energies 


6 
o ~ ax* (2) a oe] (7-94) 
mj} m m 

where a is a numerical constant. Since x*/m? ~ 10 ybarns, this cross section is 
exceedingly small up to the hundred-keV region. On the other hand, for w/m > 1, 
due to the convergence of the process we do not encounter mass singularities so 
that simple dimensional counting indicates that 


Pies: Damas (7-95) 
@ m 


with b a second numerical constant. In other words, it is likely that o peaks for 
typical values of w/m of order one. 

We shall not present a fully fledged calculation, but instead will show that 
the results of the last section in Chap. 4 allow us to compute the numerical 
constant a of the low-energy behavior. For that matter we observe that the Euler- 
Heisenberg lagrangian [Eq. (4-123)] involves a summation of all one-loop dia- 
grams, each one integrated over the same constant electromagnetic field as many 
times as there are external lines. Note that E and B are linear in the external 
momenta. Furthermore, at a given order in e the sum of all diagrams (cor- 
responding to permutations of the external line indices) is gauge invariant. When 
contracted with a polarization vector ¢,(k) for an on-shell process it should be 
invariant in the substitution ¢,(k) > ¢,(k) + Ak,. This means that this contracted 
quantity can only depend on e(k) through the combination ¢,(k)k, — é,(k)k,. Up 
to a factor i this is the Fourier transform f,,(k) of the corresponding electro- 
magnetic field. The term proportional to e* will then involve at least four powers 
of the momenta. In the low-energy limit it will then be sufficient to study the 
coefficient of the combination | |; f$?,, at zero frequency. If the external electro- 
magnetic fields coincide, the corresponding quantity is the fourth-order term in 
the effective lagrangian computed in a constant field, which was given by Eq. 
(4-125) as 


2 2 P) ys 4 
iSL, = a [(e? — B?)? + 7(E: B)?] = ae (Fy a (=) | (7-96) 


To recover the scattering amplitude we have simply to replace f by the sum 
uf = 1 (2) 4 ¢@) 4 f and divide the coefficient of the multilinear combination 
[I]; #@ by 4! With the substitution f,, = i(k,e, — keé,) the required value will 
be obtained in the limit w/m > 0. 
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As a shorthand notation we identify f,° with a matrix, and the trace symbol 
runs over the four values of the Lorentz index. It is then easy to see that 
(f jetty) (ffl = 4trf* — rf?) 
From the above instructions the amplitude M is found to be 
 _—— sli a (S(tr FOF tr fOLO + tr fOFO tr OH 
45m” 12 
+ te fYFMtr fALA) — 7 tr(fOF@fOl 4 fAlM ‘APA 
$fOIFVFAPM4 4 fDPAPV FA 4 OPA PV FA 


+f MFOPOPA)) (7-97) 
The unpolarized cross section involves the average of the absolute square of M: 
[M|*=2) |M|? 


In the center of mass frame it reads 


—— Ss ee ia ke 
~ (ky kz) | 2@3(2n)? 2wa(2n)? oe 
1 Zz 
— ? 7-98 
640" (20) lle Ze) 


To perform the sum over polarizations we note that 
2 Phy? = —Gppk°k” + Keky Gg” — Gp k' ky — Gp7k pk”) 


A tedious calculation yields 
Ln oe 
dQ (2n)? 2w? (90)*m® 
x 139 [(ki ‘ k2)*(k3 ka)? (Kay k3)°(k2 k4)? + (ky° ka) (ka : k3)"] (7277) 
The second invariant combination which could have entered this expression, that 
 —— sacar (ky * ka )(k2+k3)(k3*ka)(ka*k,) is equal to half the quantity 
under brackets in Eq. (7-99). If 6 is the center of mass scattering angle we have 
ky *k» = kaka = 2a? 
ky *ky = kaka = w?(1 — Cos 0) 
ky "ks = kz ka w7(1 + cos 0) 
so that the unpolarized differential cross section is equal to 


6 
a = oF OOF (2) = (3 + c0s? 6)" a 
Tl 


m 
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From Bose statistics this expression is symmetrical in the exchange 6 > 7 — 6. 
To obtain the total elastic cross section it is therefore required to integrate only 
over half the unit sphere in solid angle, with the result 


{139° (5G; cor ol 
ees | —)| > 1 7-101 
ae, (90)? (*) ‘ (2) m? ee 


This result agrees with our previous estimate therefore producing the required 
coefficient a. 


As an exercise the reader may compute the coefficient b entering the high-energy behavior (7-95) 
of the cross section. The photon-photon induced interaction implies the possibility of a coherent 
scattering on charged targets when two of the lines in the diagram of Fig. 7-15 refer to Coulomb 
interactions with a nucleus. This can be related to pair creation in the nuclear field and may be 
compared with the Compton process. An alternative process worth investigating is the photon 
“splitting” amplitude when a photon hitting a target (— Ze) yields two photons. 


7-3-2 Lamb Shift 


The 1947 measurement by Lamb and Retherford of the shift between the hydrogen 
states 251). and 2P;). predicted to be degenerate according to the Dirac theory 
has been a memorable event stimulating the development of quantum field theory. 
Here we shall content ourselves with a simple account of the effect, leaving aside 
the more basic questions related to the formulations of a relativistic bound- 
state theory. In fact what Lamb and Retherford measured was the 2P3,. — 
2S,/2 transitions as a function of an applied magnetic field. In the limit of zero 
field the observed value was approximately 1000 MHz lower than what could be 
expected from the fine structure interval ma*/32 ~ 10,960 MHz. 

To analyze this discrepancy we shall use, following the original work of 
Bethe, a combination of Dirac’s theory and radiative corrections, as discussed 
earlier in this chapter. We have seen that the vacuum polarization, by modifying 
the effective potential, had the effect of increasing the binding of the 2S,,2 level by 
27 MHz relative to the 2P,;2 state. This cannot therefore be the main effect since 
the experimental observation requires the 2S,)2 level to lie well above the 2P;/2 
one. Of course, a full theory requires to take into account all effects of the same 
order and to include nuclear corrections (magnetic moment, recoil, form factors, 
polarizability, etc.). Moreover, all excited levels are metastable, meaning that each 
one has a natural line width. 

In Chap. 2 we recalled Welton’s argument implying the interaction of the 
bound electron with the fluctuating vacuum electric field and leading to an 
estimated shift with the correct sign and order of magnitude: 

4,5 
OE nt ~ os m = In Sy 01,0 (7-102) 
We now want to give a more quantitative evaluation. 
Our starting point will be the effective interaction (7-77) for electrons inter- 
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acting with an external field. This leads to a modified Dirac equation of the form 


i@—m— 2s m_331 oie p = 
E m ti5|1 = (in 3 Jol+ 2 ebearn|utx) =0 


(7-103) 


including vacuum polarization and vertex corrections evaluated for a slowly 
variable external field. The differential operators inside the curly brackets operate 
on the Coulomb potential A*(x) only. 

The quantity y is to be interpreted as a matrix element of the field operator 
between the vacuum and a one-electron state in the presence of the external 
potential. According to the perturbative rules, the added term can only be taken 
into account to first order. Otherwise our calculation would be logically incon- 
sistent. The unperturbed states are solutions of the Dirac equation 


Ze 
~ An]x| 


(id -m—eA)W(x)=0 A(x) = 


es (7-104) 


which have been discussed in Sec. 2-3. Consequently, the level shift is given by 
the expression 


dE = ex [ax w(x) i 


(i 7 _ : _ 5) At4°Cx + = ye Bia) bn 
(7-105) 


where w stands for a solution pertaining to the quantum numbers (n, |, j) of 
Eq. (7-104). 

As a first approximation we may even use the nonrelativistic value for w 
in Eq. (7-105) in such a way that 


A A(x) = Zed°(x) | WO) |? = -, a ay 


Tm 


This yields 


3 
3 (Za)* 


e | ax Wi ix \[AA°(x)] Wat, j(X) = ce 


Define now the separate contributions of the two terms in (7-105) as 
bE = 6E + 6E® (7-106) 
It follows that 


ea } moe | a . 
a= = [ax Wri) (i Tae :) [AAC] Vn iX) 


4ma BAe 
= 10 3 aa ee ay (in™ -3-3) 


(7-107) 
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The relativistic wave functions are singular at the origin. A more rigorous treat- 
ment would require avoiding an expansion of the effective interaction in powers of 
q?/m*. Since only mean values of the potential enter the expression (7-105), the 
above result remains accurate to order a(Za)’*. 
We turn now to the second term dE), which we intend to compute in the 
same spirit. Some limited use of the Dirac equation is necessary since y matrices 
relate small and large components. To this end we express dE},/; as 


SEQ) = | dx Whi) 7° EO ns (%) (7-108) 
oo 4mm 
which represents the effect of the anomalous magnetic moment inducing an electric 


dipole moment for a moving electron. The large (~) and small (vy) components 
of w are approximately related by 


yy ioe) 


2m 
ignoring the effects of the Coulomb potential. Since eE(x) = —Za(x/|x|*) and 
,o°X 1 o*xa°V 
1c a Se 
MYO px” pep em AG —— ‘) 


we find after an integration by parts that 


ax W"00 : * W(x) ~ x le x@ 6 Y, “lo 


As in the case of the hyperfine interaction, some care is required when handling 
such a singular quantity as x/|x|°. We recall that x/|x |? = — V1/|x|and Al/|x|= 
—4n6>(x). We can take for g a Pauli spinor and use the expression of the non- 
relativistic angular momentum operator L = (1/i)x x V together with the identity 


[o,Aq 0Bs] = 5ap(AgBy — ByAa) + ifaneO(AaBy + By Aa) 


to obtain 


SE?) ; = oe [as phil)  s%en +4 ag 13S oe 108) 


We know the value of the wave function at the origin. The matrix element of 


L-S is equal to (1 — 6,,0)[j + 1) — + 1) — #]/2. Finally, the mean value of 
1/|x|° for states of angular momenta equal or larger to one is 


! 2 4 
G = | a. = + Dale De (Zma) 
Putting everything together we find 


a(Za)* 1 
Inn? TEA 


6E®); a C jul 
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G+ D= M+ D—3 


Cin = 610 + (1 — 610 +1 


(7-109) 
ifj=1+4 


= ifj=1—4, I[>1 
For states with nonvanishing orbital momentum, dE“) vanishes, while 5E 
gives a small contribution of order «(Z«)* originating from the electron anomaly. 
For s states our calculation of dE still contains the fictitious mass p and 
diverges as 1 > 0, a signal that we have omitted some essential contribution. 
As the qualitative argument indicated, an effective infrared cutoff was expected 
of the order of the Bohr radius (Zma)~' instead of the arbitrary value p7?. 
The mistake can be traced to the use of the effective interaction (7-77) for 
arbitrary large wavelengths. For wavelengths comparable to or larger than the 
Bohr radius the radiative corrections should in turn take into account the Coulomb 
interaction, 1.e., the fact that they apply not to a free but to a bound electron. 
In order to correct for this effect we use the observation that the ratio of 
the binding energy to the rest mass energy is very small, of order Za. Let 
us introduce a cutoff K on the three-momentum of virtual photons such that 
mZ*a? «K «m. For k < K it will be possible to treat the electrons in a non- 
relativistic approximation and we shall be able to take the nuclear potential 
into account. For k > K we will neglect the effects of binding and use the previous 
results, except for the fact that we need an accurate relation between K and uy. 
To do this we return to Sec. 7-2-3, where we computed the rate of soft 
bremsstrahlung by integrating the quantity 


oa ual 2 m? 
© An? Jone 20 | ke pk-p’ (ke py (ke py’ 
The latter was evaluated for u «AE but p # 0. Let us repeat the calculation 
setting = 0 and using instead the lower cutoff K(K «AE) on photon three- 
momenta. Denote such a quantity as B,. The comparison of B with B, will offer 


a means of relating K and p. Keeping the definition of the hyperbolic angle @ 
such that p: p’ = m* cosh 29 we find 


AE 
Jape Of coth 2g — 1) In — 
1 K 


Our previous result for B was [Eq. (7-88)] 


2a 2AE 1+ 
Se —B 
1 = bs | 1 sae aa 
= 700 20 FS | om 0 BENE — cos" OQ)" 1 BE 
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For bound electrons the velocity is small (B ~ Za). It is therefore appropriate 
to match B, and B in the limiting case where fp +0, 9 ~ B sin (6/2). Under 
these circumstances the integral appearing in the expression for B is approximately 


equal to 
6 4 2 29 | 
ap sing [1+ 5p (1 +2.00s 5)+ 


We may then identify B and B, provided that 


im 5 
lt eens (7-110) 
The Lamb-shift calculation only involves one virtual photon line. Furthermore, 
the nucleus dictates a privileged frame for which the splitting k > K is legitimate. 
We decompose accordingly dE" into two parts, 6E* and dE~. For 6E~ we 
use the result (7-107) with the substitution implied by (7-110). The remaining 
piece, SE<, requires a new calculation. Our expression is more correctly 


6E = 5E< + 6E” 


4 4 5 (a Vane 
6E” = —ma cas, a 61,0 
n 8 § 


~ 32 


This will be meaningful if the dependence on K cancels between 6E~ and 6E~. 

To obtain 6E~ we return to the original procedure of radiative corrections 
with the proviso that the momenta occurring in the virtual radiation field are 
limited by K and hence can be treated by second-order nonrelativistic perturbation 
theory starting from a Schrodinger equation of the form 


ee ; 
(Ese yy — E (; v- eA) + e(A® + 49 Jy (7-112) 
Here wy is a wave function for both the electron and the radiation field (A}, 
A,) which describes in the unperturbed state a bound electron and the electro- 
magnetic vacuum. The quantum part effectively reduces to 


d°k 
DN = —— - (a) ik-x 
is im 2|k|(2n)° 2 2 ae | 


=1, 
A2(x) = 0 (7-113) 
We treat the interaction with the radiation field to lowest nonvanishing order 
and find two contributions. The first one is a “seagull” contribution arising 
from the quadratic term in A,. It may be reabsorbed in E as a contribution to 
mass renormalization, since it affects indiscriminately all levels. The second one 
has the form 
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d>k e 
OE; = See 
» . 2k(2n)° » (2m)? 


n’ 


x fax 2 UOMO? + 114 


En — Ey —k 


where E, and E, stand for the unperturbed levels, E, ~ m— mZ*a7/2n?. The 
range of the x integration is effectively limited by the Bohr radius. Accordingly, 
K|x| S K/mZ«. From our hypothesis on K this may be considered as very small, 
justifying the dipole approximation where ex js replaced by unity. Define 
Vop = (1/im)V and note that 


Y [<n|Vop 3 €,(k)|n’>|? a (s* _ sae 


Jens ><a oto 


a=1,2 k 
dQ kk 2 
d wie ab Sa 
= \2 (1 i? ) 3 on 
We therefore conclude that 
20 x dk k 
IS = — a ellie 2 - 
a | Ene, aK lori (7-115) 


Formula (7-115) is not yet the correct one since it does not take into account 
mass renormalization in the sense that we are using in v,, the expression of the 
physical mass. The contribution of very soft photons (k < K) to the self-mass has 
yet to be subtracted from it. In other words, a counterterm of the form 
—(6m/2m?)pop° = —(dm/2)vop? had to be inserted in the hamiltonian with 6m 
adjusted in such a way that 6E~ vanishes for a free electron. Since <n|vo,"|n> = 
Yn |<n|Vop|n’>|* and, owing to the fact that K >|E, — E,’|, it is clear that the 
correct expression for 6E~ is 


epee 
ses =| 


* dk k eae | <n| Vop|n’>|? 
0 E,—E,y—k k ce 


K 


[Sa 


= = 3 (Ey — E,)\n | <n|Vop|n’>|? (7-116) 


At that stage we can only do a numerical evaluation. Bethe, who was the first 
to determine this nonrelativistic contribution, has introduced the following log- 
arithmic average. For a state |n> pertaining to an s wave define <E,,> through 


Dae | <n|Vop| n> |? (Ew = E,) 
using an arbitrary, but fixed, energy scale. This definition becomes meaningless 


for higher waves (/ + 0) where the denominator vanishes, as we shall soon realize. 
Therefore, in this case 6E~ becomes independent of K, a fortunate circumstance. 


ince = 1=0 (7-117) 
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For this case we define, nevertheless, a quantity <E,> through 


ee ee : I 
OE; = pe | <n| Vop|'>|? (En — En) In ears | 
_ 4a (Za)* | Z? Ryd 
140 (7-118) 
ae Ms) Zi 
mo 
a’ 


For s waves, using Eq. (7-117) we have 
2a | K . 
bEw = 37 114M | Vop|n'>|*(Ew — £,) n> <E,> l=0 


Let us transform this expression. If Ho is the Schrodinger hamiltonian including 
the Coulomb potential we may write 


ys (Ey — E,)| <n = <allivers Ho] "Vop|M> al Gnas [Ho, Yop] |"> 


Z z 
ieee 


1 
= sis [as (a +): V[U* (x) (x)] = £7 4 0 oe 


2m(Za)* 5 


—— Sage (1-0) 
n> 


It follows then that for s waves 
<_4a(Za)* 
ob i ae ee 


We now combine all the pieces: 6E~ given by (7-118) and (7-119), dE” given by 
(7-111), and 5E® by (7-109) (which also contributes for | = 0). The arbitrary 
quantities u and K disappear and the Lamb shift is found to be 


19 


0 (7-119) 


m 
 eeneleteaeeet meen for 1=0 
sp. = at Za 2<Eno> 30 a “Ae 
nl = an n> AZ Ryd 5 rey , ( = 
fe, for 140 
= pe 8 2)-ea 


with C,,; given in (7-109). 


The main contribution was computed by Bethe while the «(Za)* term was obtained by Kroll and 
Lamb, and by French and Weisskopf. 
For atomic hydrogen using the numerical values 


<Exs>) = 16.640 Ryd = <Ep> = 0.9704 Ryd (7-121) 
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the value for the 2S,). — 2P,)2 splitting is predicted by formula (7-120) to be 


“(in eee! | _ en 
Pa TR oo) (7-122) 


which compares favorably with the 1953 measurement of Triebwasser, Dayhoff, and Lamb of 1 057.8 + 
0.1 MHz. 

To improve the calculation it is necessary to develop a more elaborate theoretical framework. 
Further corrections involve a refinement on the electron propagation in the Coulomb field (higher 
powers and logarithms in Za) and require a treatment of higher-order radiative corrections. Terms 
implying the nuclear recoil [in (m/M)(Za)* and (m/M)\(Za)° | and effects of a finite nuclear radius 
Ry [in (Rvm)*(Z)*] have also to be included to obtain the latest theoretical values of Erickson (1971): 


1 057.916 + 0.010 MHz 


Fas, ea E2P,, a 


or Mohr (1975): 
1 057.864 + 0.014 MHz 


where the main uncertainty arises from higher orders in the binding (Za) in the second-order (a) 
electron self-energy. The latest experimental values are due to Lundeen and Pipkin (1975): 


1 057.893 + 0.020 MHz 
and Andrews and Newton (1976): 
1 057.862 + 0.020 MHz 


7-3-3 Van der Waals Forces at Large Distances 


We return to the question of the large distance potential between polarizable neutral systems intro- 
duced in Chap. 3. Our derivation is modeled after the work of Feinberg and Sucher. 

We have seen that a phenomenological hamiltonian density could account for the interaction 
energy ofa polarizable neutral particle described by the scalar hermitian field g with a slowly varying 
field. It reads 


Hn = 910.90, 9F” F*, + go? F? 


= Or + Op 
n= = (7-123) 
m 
7 = ns ap 


The couplings g, and g, are related to the electric (xg) and magnetic (ag) susceptibilities in such a 
way that a particle at rest would contribute an interaction energy of the form 
E- B? 


lot = — 05 — (7-124) 


6 


The type of system we have in mind may be, for instance, an atom whose interna] structure is dis- 
regarded except for the parameters x, and as. Note that these quantities have the dimension of a 
volume. To define a static interaction potential between such systems we proceed as follows. Assume 
os and wg small enough to identify the Born term of a scattering amplitude in the low-energy regime 
with the Fourier transform of the potential according to Fermi’s golden rule 


ee 


dou = — =| —iV(q "5 = 2n6(Es — E;) (7-125) 


Here v,, stands for the relative velocity of particles a and b, q is the transfer momentum, and V(q) 
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is such that 


| 24 pq) eit 7-126) 
V(r) = (as Viqves ( 


Let F be the fully relativistic scattering amplitude arising to lowest order from the interaction (7-123) 
(see Fig. 7-16). Let us study the threshold behavior of the relativistic cross section, ie., the limit 
(Pa + Po)? > (ma + my)?: 


1 d°pi, d° py 


FS ne ee ei 0 SSS 2: 4 54. a eas = 
= apo? mina? | nO) On) QED C7 ° Pat Pb— Pa — Ped 


dome 


In this limit p,+p, ~ mam, + (p?/2)[(m. + ms)?/mam,] with p = |p| equal to the magnitude of the com- 
mon three-momentum of particles a and b in the center of mass frame. Therefore, 


[(Pa* Po)? — m2mpz]*!? ~ (ma + my)p = mam, | — — aE) = ples 

mM, Mp 

and i.) | Se (7-127) 
n o—- — — 21 _ - 
Vap | 4mam, | (27)? ee 
This means that close to threshold 
An Re 
-iV(q) = (7-128) 
4m,m, 


where —q? can be identified with the square relativistic momentum transfer q? = (p, — p,)’, and the 
center of mass energy square s = (p, + py)” is taken equal to (m, + m,)°. 

From (7-123) we can compute the amplitude F to order g? as represented in Fig. 7-16. This is 
again a one-loop diagram with a strong ultraviolet divergence. A theory based on (7-123) is not 
renormalizable due to the dimensionality of the coupling constants. However, we agreed to use it 
only as a phenomenological description and as such to any order we may perform a finite number of 
ultraviolet subtractions without modifying the long-range behavior of V which we intend to find. 


Therefore for our purpose we are safe. The elementary contraction needed to apply perturbation 
theory is 


d*k e ike (xy) 


€O| TFyv(x)Fpo(y)|0> = —i (Qn)* k?+ie 


Kyv, pak) 


(7-129) 
J Sprepyicl Use) = heed ve a Kyk pF ne — kukeQvp SF KykaGup 


Before any subtraction the Feynman rules provide us with an amplitude F of the form 
eo | d*k d*k' 54(k +k’ — q) 
(2n)*(k? + ie)(k'? + ie) 
d(k, k’) = 893.93 K ws, pak) K*""(k’) 
+ [4gig5(pép's + pop's)K™7(k) K*, pk’) + (a+>b)] ee 
+ gigi[(sp’s + Pop's) K°"*” (K)K*,* (k)(pe-p'p + ppp’) + (keok)] 


d(k, k’) 


A convenient way to perform the ultraviolet subtractions without modifying the long-range behavior 
follows from the observation that for fixed s = (p, + ps)’, F is analytic (and purely imaginary) along 
the semiaxis q” < 0. Its cut along the real axis arises from its real part and we are only interested 
in the vicinity of q? = 0. We therefore write 


vie Fx(m’) 
ee dm? ——.—— + AF (q* Le 
I 7 ee + AF (q°) (7-131) 
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Dy oy 


pt Figure 7-16 Interaction between neutral systems arising from 


Py, ; ‘ ee 
b electric and magnetic susceptibilities. 


with A¥ regular in the vicinity of q? = 0 and M? positive and arbitrary. Consequently, from (7-126) 
and (7-128) the asymptotic part of the potential will be given by 


1 M? 
V(r) ean | dm? Fp(m?) e—™ 


ag 2 
l6x*m,mpyr Jo 


1 ° M22 y? 
| dy” Fx(%a)e (7-132) 
r 


i; 2 3 
16x*mam,r> Jo 


We shall soon see that Fa(q*) behaves as — 4(q?)* for q? 0 so that the dominant part of V will 
be obtained by taking the limit Mr > oo in (7-132), that is, 


—A aa 5!A 
V. =a es ee d 4 3 4 _-x Se SS Ts 
as() 1622m,m,r7 | sent 8x2m,m,r" 22) 
It remains to compute 4. To do so we observe that each denominator in (7-130) may be split into 
two terms according to | (k* + ie) = PP k* — ind(k?) where PP means principal part. This we 
abbreviate as P — ixd. Up to the polynomial d the integrand in (7-130) takes the form 
(P — ind)(P’ ~ ind’) = PP’ — 27665’ — in(5P’ + 6'P) 
This is, of course, nothing but the Fourier transform of [G(x — y)]* in configuration space with 
Gr the Feynman propagator. The effect of d is then to act as a polynomial of derivatives on this 
quantity. Now if instead of using G? we were to use the product Gagy(x — y)Gye(x — y) with at most 
a support at x = y we would obtain by a Fourier transformation an ill-defined polynomial of the 


momenta without contribution to the real part of 7. Consequently, as far as we are concerned, the 
replacement 


1 1 1 1 
K+ ie RK? Fie k? + (kk Ye K? — (K/|K De 


would lead to zero. This means that we may subtract to the previous contribution the combination 
(P — ined)(P’ + ime’d') = PP’ + 1:786'56' + in(Pe'd' — P’e0) 
with €6 standing for (k°/|k°|) 6(k’). The result is the quantity 
(PP’ — 1756’) — (PP’ + n7ee'56') = —17(1 + e8')d6' 


Note that 1 + ce’ = 2 if k° and k’® are of the same sign and zero otherwise, and that the 6 functions 
project the photons on-shell. For q? > 0 the conservation of energy momentum k + k’ = q implies 
that we may keep only one of the two possibilities k° > 0, k’° > 0, for instance. We have now 
1 a®k aek’ 
Fp —— | —— —____ (2n)* 5*(k + k’ — gq) d(k,k’)|,._,, 7-134 
2 | ao 2k'°(2n)? ( ) ( q) ( Me =k'2?=0 ( ) 
This lengthy derivation is nothing but an application of the Cutkosky rules (Chap. 6) to this special 
case. Of course, #p as given by Eq. (7-134) is a convergent quantity and we simply want its leading 
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behavior near y* = 0. We first evaluate d(k, k’) given by (7-130). The explicit expression is slightly 
cumbersome: 


d(k, k)| 2 pag = O49202(K-k'P + 16g91g3(p** p(k k’Y 
+ 169% g3(p>= p>Mk-k)? + 4gt gi [(p*: pp: p*)(k- KY 
— (p= p’)(p'2* yp’? kkk’) — (p= pp? KY(p? * Wik k) 
— (p'> pp? k’)(p>> kkk’) — (p'** p'p* Kp?» K(k Ke) 
+ (p'= p°)(p* p*\(k- k’)? — (p'* p’(p** kp? * kkk’) 
= (p= ppt kip? kkk) — (ps pop? K)(p? = kylk +’) 
— (p?- p*\(p’?+ kp? = kk > kK) + (p** p')(p" = K)(p? * K(k k’) 
+ (pt p'\(p? > k’)(p® = kk +k) + (p> p’*(p* Wp? K(k Kk’) 
+ (p? + p®\(p*- k’\(p'?> K(k’) + 2(p"* K)(p'** KY? * kK)” * ke) 
+ 2(p*+ k)(p’*- k\(p’ > k’)(p"” * k’)]} (7-135) 


We have 2k-k’ = q? and we look for the coefficient of (q*)’ after integration. Therefore the scalar 
products multiplying (q?)? of the type p*: p” or p*: p’’,..., may be replaced by m,mp, a We set 


if @k dk’ 8(4?) 
9,2) | So Stk ek =i (7-136) 
a 5 | aoe Kemp em 2 9) lex 


where <d> is a function of q” alone close to the threshold and reads 


[ene aree) d*(k +k —q)d 
<d> = (7-137) 
[@mneareik) d(k +k’ —4q) 


We need 
Ckuk> =$ Qudv + 229 uvq7 (7-138) 


obtained by remarking that the trace is q7/2, and 


CKukvkakp> = 44u4v4a9e + b(Quqvgap + 4a969 uv)" 
a 4? (qudaG vp + 4v4aGup + Qu9b9va + F989 ua) 
+ (q°)? Guvgap + €(97)’ (GuaIvp + GreI up) CEL) 


The form (7-139) is obtained by neglecting all terms behaving as higher powers of g? and using 
the obvious symmetries in the exchanges (4 v), (2 f), (u, vera, f). To get the five coefficients 
a, b, c, d, and e we use the following five conditions: 


(a) Taking the partial trace over (1, v) we must find zero (mass shell condition) implying 
a+4b+4c=0 b+ 4d+2e=0 
(b) Taking the partial trace over (v, «) we obtain q?/2 <k’,kg> already computed; therefore, 
a+2b+6c=7 ctd+5e= 
(c) Finally, contracting with q"q"q%q* yields (q7/2)*; thus 
at+2b+4c+d+2e=7 
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Solving this system we obtain the coefficients needed in the sequel 


1 
(aa 
15 x 16 


(7-140) 


At threshold we have 


eel \ qe (TV saccxeee (TY a2 a 
Ure | + 169tg> >} ma + l6gtg$ ease + 4g%g% | 2m2m2 a 


2 


q i} a ‘ad s pL , , 
— 5 Ue Pa Pa PP) + gp’ pd) + (Ps ML2Aa PG: PY) + 4°(0" PY) 


12 
+ (p'*+ p°)[2(q° p"\(q° py”) + q7(p*- pp”) + (p** p”)[2 (a p'"(q- ”) + 47(p* Dy} 
qr a ‘a UG 
a {(p** p")[2(q* p°)(q- pp’) + q?(p’ p)] + (p”* p”)[2(4- P*V(q* B'’) + 47 (p"* P'))} 
+ 4¢(p*-k’)(p'*-k’)(p?- K\(p?- ®) (7-141) 


But q° p*, q°p’", gp”, q° p” are all proportional to g?. For instance, 


¥ a ¥ p* + jae p* = mye q 
e =, = + = 
q:p° = (p ”( 5 5 ) 5 


Therefore when extracting the coefficient of (g7)? all terms involving as coefficient scalar products of 
q with an external momentum are negligible. In particular, the contribution of the last mean value 
reduces to 

mam (q?)? 


<(p* Kp" k’)(p?- k\(p” > k)> = mamg(q’y’ (d + 2e) = 80 


The coefficient A defined as Fz = — A(q*)* + -:: is thus equal to 


3 
= én Ez + 4(m29%.g5 + mig’ g3) + iE mimbotot | 
mam 1 ; " 
= ix oa — 5 (a8 + orb) a — 5 (ak + ob) a + a (bt ot) (ob + | 7-142) 


Returning to Eq. (7-133) we obtain the asymptotic behavior of the Van der Waals force as 


” [23(ockock + aor) — T(akae + ox) ] (7-143) 


Vas(r) = (4x)3r7 


Let us close this section with a brief comment on the relation between the relativistic potential in 
1/r’ and the usual nonrelativistic 1/r° law, disregarding the effects of retardation. In a nonrelativistic 
calculation we only include the instantaneous Coulomb forces. To simplify, let us consider two 
hydrogen atoms at distance |, — Fp| with electrons located at r, +41, and r,+ 12. Neglecting 
altogether spin and the Pauli principle, the interaction potential may be written as 


a 1 1 i 1 ) 
V=— $$$ 5s eee”) @©h3 DT 
4n\jr—r| |retn—m—m| jt —%| |r, + 82 — Fal 


At large separations, neglecting angular dependences, this is qualitatively of the form (e7/4xr3,)rir2- 
However, due to this angular dependence the first effect of this perturbation vanishes (we are using 
an adiabatic approximation). To second order we obtain a negative contribution of the type 


1 


peas oa 
itte)° 


1 
<e7 12>, e713 >» KE 
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where AE is a characteristic atomic excitation energy. Similarly, the (electric) susceptibility a for each 
atom is obtained by treating a perturbation eE~r to second order, with the result that 
Cs a 

AE 


g~m~ 


Altogether we find 
NES, Lp 


(rav)° 


v™ eet 


We therefore expect that an interpolating formula between the r ° and r~ ? laws involves a charac- 
teristic retardation time t proportional to h/AE. The transition between the two regimes would occur 
at distances of order ct = hc/AE, where AE could be taken as a fraction of the ionization energy. 
For hydrogen this estimate would be of order 10? times the Bohr radius. In the practical theory of 
liquids this implies that we are always justified to work in the framework of the nonrelativistic 
approximation. 


NOTES 


Radiative corrections are treated in all textbooks and many of the original 
articles are reproduced in Schwinger’s “Quantum Electrodynamics,” Dover, New 
York, 1958, already quoted. Let us give a short list. 

Subjects covered are as follows: gauge-invariant regularization in W. Pauli 
and F. Villars, Rev. Mod. Phys., vol. 21, p. 434, 1949; vacuum polarization, limit 
k? +0, in E. A. Uehling, Phys. Rev., vol. 48, p. 55, 1935; complete treatment in 
J. Schwinger, Phys. Rev., vol. 75, p. 651, 1949; Ward identities in J. C. Ward, 
Phys. Rev., vol. 78, p. 182, 1950, and Y. Takahashi, Nuovo Cim., vol. 6, p. 371, 
1957; electron anomaly to order « in J. Schwinger, Phys. Rev., vol. 73, p. 416, 
1948 ; and electron form factors to order « in R. P. Feynman, Phys. Rev., vol. 76, 
p. 769, 1949. Work in the 1960s on higher-order corrections is reviewed in the 
report of B. Lautrup, A. Peterman, and E. de Rafael, Physics Reports, vol. 3C, 
p. 193, 1972. The most recent theoretical value quoted in the text comes from 
the work of P. Cvitanovic and T. Kinoshita, Phys. Rev., ser. D, vol. 10, p. 4007, 
1974. A large number of theorists have also contributed to this effort. The 
astonishingly accurate experimental value is due to R. S. Van Dyck, P. B. 
Schwinberg, and H. G. Dehmelt, Phys. Rev. Lett., vol. 38, p. 310, 1977. The 
experimental value of the muon anomaly has been taken from the work of 
J. Bailey, K. Borer, F. Combley, H. Drumm, F. Farley, J. Field, W. Flegel, 
P. Hattersley, F. Krienen, F. Lange, E. Picasso, and W. Von Riiden, Phys. Lett., 
vol. 68B, p. 191, 1977, and the theoretical prediction from the review of J. Calmet, 
S. Narison, M. Perrottet, and E. de Rafael, Rev. Mod. Phys., vol. 49, p. 21, 
1977. The original treatment of radiative corrections to Coulomb scattering is 
due to J. Schwinger, Phys. Rev., vol. 75, p. 1912, 1949. We follow in part the 
presentation in the book by A. I. Akhiezer and V. B. Berestetskii, “Quantum 
Electrodynamics,” Interscience Publishers, New York, 1965, which is extremely 
rich in detailed calculations. For infrared divergences we remind the reader of 
the work of D. Yennie, S. Frautschi, and H. Suura quoted in Chap. 4. As for 
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the Lamb shift the general aspects of hydrogenic atomic systems are covered in 
H. A. Bethe and E. Salpeter, “Quantum Mechanics of One and Two Electron 
Atoms,” Springer-Verlag, Berlin, 1957. The crucial experimental evidence comes 
from the experiment of W. E. Lamb and R. C. Retherford, Phys. Rev., vol. 72, 
p. 241, 1947. A more refined measurement in the 1950s is due to S. Triebwasser, 
E. S. Dayhoff, and W. E. Lamb, Phys. Rev., vol. 89, p. 98, 1953. 

The recent values quoted in the text are from S. R. Lundeen and F. M. Pipkin, 
Phys. Rev. Lett., vol. 34, p. 1368, 1975, and D. A. Andrews and G. Newton, 
Phys. Rev. Lett., vol. 37, p. 1254, 1976. The early theoretical contributions are 
those of H. A. Bethe, Phys. Rev., vol. 72, p. 339, 1947, for the nonrelativistic 
part, and of N. M. Kroll and W. E. Lamb, Phys. Rev., vol. 75, p. 388, 1949, 
and J. B. French and V. F. Weisskopf, Phys. Rev., vol. 75, p. 1240, 1949, for the 
relativistic one. For a consistent theory of the effect to all orders see, for instance, 
G. W. Erickson and D. R. Yennie, Annals of Physics, vol. 35, pp. 271 and 447, 
1965. The theoretical prediction has been taken from the work of G. W. Erickson, 
Phys. Rev. Lett., vol. 27, p. 780, 1971, and P. J. Mohr, Phys. Rev. Lett., vol. 34, 
p. 1050, 1975. 

For a recent estimate of the accuracy achieved in these higher-order com- 
putations one may consult the review by T. Kinoshita in “Perspective of Quantum 
Physics,” published by Iwanami Shoten, Tokyo. 

Needless to say, the previous short list of references does not do any justice 
to the enormous and beautiful experimental and theoretical work in quantum 
electrodynamics. 


CHAPTER 


EIGHT 
RENORMALIZATION 


Renormalization to all orders is at the heart of field theory. After an introduction 
devoted to a study of various regularization methods we present the Bogoliubov- 
Zimmermann subtraction scheme. We give indications on the convergence proof 
of renormalized integrals, and study ultraviolet behavior (Weinberg’s theorem) 
and massless theories. Renormalization of composite operators is briefly discussed. 
The interplay between gauge invariance and renormalization is dealt with in the 
last part of the chapter. 


8-1 REGULARIZATION AND POWER COUNTING 


8-1-1 Introduction 


This chapter is devoted to a systematic study of the renormalization procedure. 
The underlying philosophy has already been presented in the preceding chapter, 
in the case of quantum electrodynamics. We have seen that divergences may be 
absorbed in a redefinition of the various parameters, mass, coupling constant, 
etc., of the theory. It is convenient to get rid of the difficulties specific to electro- 
dynamics, namely, gauge invariance, and to study first the renormalization of a 
scalar theory. The new problems arising from the existence of symmetries will 
be considered at the end of this chapter for quantum electrodynamics, and in 
Chaps. 11 and 12 for other internal symmetries. 


372 


RENORMALIZATION 373 


Figure 8-1 A divergent diagram and the 


“ 2 associated counterterm. 


(a) (b) 


The nature and properties of renormalization were first formulated and 
studied by the founding fathers of quantum field theory, Tomonaga, Feynman, 
Dyson, Schwinger, etc. Important contributions were then made by Salam, Wein- 
berg, Bogoliubov and Parasiuk, and Hepp. More recently, Zimmermann and his 
followers have further clarified the systematics of the renormalization operation, 
while Epstein and Glaser presented an axiomatic approach. 

The mathematical nature of the problem is clear. Divergences occur in per- 
turbative computations because of lack of care in the multiplication of distribu- 
tions. For instance, the amputated self-energy diagram of Fig. 8-la is the ill- 
defined expression [Gr(y, — y2)]’. As it stands, this expression does not make 
sense; In momentum space, its Fourier transform is logarithmically divergent. 
As we saw in Chap. 7, such divergent expressions must be subtracted. Since 
we demand that the subtractions be local in configuration space, this operation 
amounts to a redefinition of the parameters of the lagrangian by an infinite 
amount. Therefore, we abandon the idea of using or observing the parameters 
of the initial lagrangian, the so-called “bare” quantities, and reexpress everything 
in terms of the finite “renormalized” and observable parameters. 

In the previous instance, we replace G#(y; — y2) by 


II(y1 — y2) = [Gr(y1 — y2)]? — S(y1 — 2) 


where S is a distribution concentrated at the origin, chosen in such a way as to 
make the whole expression IT meaningful. Here a term proportional to a 6 func- 
tion suffices. In Fourier representation 


1 ia. d*k i 
[Gr(y1 — y2)]? = |e eiP (i-¥) | 


(27)* (2n)* (k? — m? + ie)[(p — k)? — m? + ie] 


is replaced, for instance, by the well-defined expression 
d*D pip-(-y2) d*k 
py les ee, par if 2 eee 
IT(y 1 y2) aE |e (27)* 


1 1 | 
‘ r —m? + ie)[(p — k)? — m? + ie] (k? — m* + ie)? 
and we may write formally 
T(y1 — y2)= [Gr(ys — ya)? + Ad(y1 — ya) (8-1) 


Strictly speaking, the constant A is infinite, but formally the second term in (8-1) 
may be regarded as coming from the zeroth-order counterterm depicted in Fig. 
8-1b. By imposing a definite normalization condition on Tl, we may fix un- 
ambiguously the finite part of A. Let us emphasize that this renormalization 
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procedure works because of the locality and reality of the subtraction and hence 
of the counterterm. 

This subtraction operation may be presented in a systematic way, as we shall 
see in the following. If only a finite number of types of additional terms is required 
in the lagrangian to eliminate the divergences of Green functions, the renor- 
malized theory will depend only on a finite number of parameters. Such a 
theory is called renormalizable or super-renormalizable. It remains to prove that 
the renormalized integrals are indeed finite and satisfy the constraints of locality 
and unitarity. We shall evoke briefly these points. 

A slightly different approach, proposed by Dyson and Schwinger, relies on a 
set of integral equations between proper Green functions (Sec. 10-1). It also un- 
fortunately involves infinite multiplicative renormalization constants at inter- 
mediate stages. 

Finally, the most orthodox procedure of Epstein and Glaser relies directly 
on the axioms of local field theory in configuration space. It is free of mathe- 
matically undefined quantities but hides the multiplicative structure of renormal- 
ization. The latter will lead under proper interpretation to the study of the 
renormalization group (Chap. 13). 


8-1-2 Regularization 


In order to give a meaning to otherwise formal and divergent expressions, it 1s 
important as a preliminary step to regularize the perturbative expansion. After 
performing the renormalization subtractions, we shall be free to remove this 
regularization. Several procedures have been devised. The final results are finite 
and independent of this choice. 

The most naive prescription would be to perform a Wick rotation and to 
cut off the large values of the (euclidean) four-momenta of each loop. Every 
such integration would be restricted to a compact sphere (k’)!? < A. This cer- 
tainly makes any Feynman amplitude finite, but it ruins our beloved Poincaré 
(or rather euclidean) invariance. Therefore it is used only exceptionally, in heuristic 
arguments. A variant consists in discretizing space-time, i.e., in assuming that 
configuration variables x, take only discrete values corresponding, say, to the 
vertices of a regular lattice. Clearly, cutting off the small distances is equivalent 
to cutting off the large momenta. Here, too, rotational invariance is lost. 

A covariant regularization is obtained by substituting to the Feynman propa- 
gator Gr(x — y, m) an expression of the form 


GHe(x — y) = Gr(x — y,m) + ¥ Cy Gr(x — y, My) (8-2) 
k 


where the coefficients C, are judiciously chosen, as functions of m and the M, 
so as to remove some of the singularities of Gp. Which degree of singularity is 
acceptable in order that every Feynman diagram of a given theory be finite, 
and therefore how many terms are needed in the sum (8-2), will be clear as soon 
as we have given a criterion of convergence. Here it suffices to say that any 
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given diagram may be made finite by such a substitution. This is the case of the 
self-energy contribution considered in the introduction, for instance, if we 
substitute 


Gr(x — y, m) > Gr(x — y,m) — Ge(x — y, M) 


or in momentum space 


1 1 1 


> —=— — 


k7—m?* k?—m? = k? — M? 


In the general case, if we denote collectively all the auxiliary masses M, by A, 
the initial propagator is recovered by letting the cutoff A go to infinity. 

Modifying the large-momentum behavior of the propagator may also be 
performed on its parametric representation. It then amounts to modify the small 
« integration region in the form 


li da eiatk? —m? + ie) =|" do NG)) eitk? —m? + ie) 
0 0 

where the function p,(x) vanishes with some of its derivatives at ~=0. We 
demand also that as the cutoff is removed, that is, A— 00, pa(a)— 1 for all 
a > 0. For example, we may take p,(«) = @(a — 1/A”) or p(a) = a*~! where A> 1 
as Acc; the latter case corresponds to a propagator (k? — m? + is)7* in 
momentum space. This type of regularization has been extensively studied by 
Speer. 

In some instances, it may be important to use a more subtle procedure, 
in order to maintain some invariance properties such as gauge invariance. The 
Pauli-Villars regularization, already encountered in the previous chapter, has this 
property. Every photon propagator is replaced by a sum of the form (8-2). On 
the other hand, only fermion propagators pertaining to internal closed fermion 
loops are modified. More precisely, to a closed loop with 2n vertices, we associate 


2n Ss 2n 
tr it ny Sela zp ays m| = >) C0} | ! Vu, SH Zp —Zp-1; Mo (8-3) 
=1 


s=1 S 


where Zo = Z2n. We shall reexamine this regularization in greater detail in 
Sec. 8-4-2. 

Divergent Feynman integrals would be ultraviolet convergent in a smaller 
space-time dimension. In the dimensional regularization of ’t Hooft and Veltman, 
the Feynman integrals are computed for an arbitrary integer space-time dimen- 
sion d. The result of the integration may then be analytically continued to 
arbitrary real or even complex values of d. In this regularization, ultraviolet 
divergences manifest themselves as poles at rational or integer values of d. We 
are eventually interested in letting d go to four, the physical space-time dimension, 
and therefore we focus on the simple or multiple poles at d = 4. This continuation 
may also be defined in the presence of Dirac y matrices, except for the case of the 
ys matrix which is quite particular to d = 4 (or in general to even space-time 
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dimensions). The virtue of this method is that it automatically preserves internal 
symmetries that do not involve the ys; matrix. Technically, all the manipulations 
that we perform to check Ward identities (see Chap. 7 or Sec. 8-4) such as shifts 
of integration variables, contractions of Lorentz indices, etc., are consistent with 
the regularization. 


The analytic continuation to d-dimensional space-time is most easily performed, after a Wick rotation 
to the euclidean region, on the parametric form derived in Chap. 6. Indeed, we may compute the 
amplitude of Eq. (6-94) in an arbitrary integer dimension d (we drop henceforth the caret notation 
for euclidean momenta): 


1 dtk if 
(22)'64(Y p)Ig(P) = | il 


t=1 (2n)¢ k> +m 


5 | 2nt64P, — Y euki) (8-4) 
tov t 


The method of Sec. 6-2-3 is easily seen to go through, and the result reads 


Io(P) = ie [os 


: I a [(4n)" Po(a)} 4? Ca 


with the same functions Qg¢ and Ye as those defined in Eqs. (6-86) and (6-87). The integration over 
the homogeneity parameter 4 of the « is convergent at A = 0 if J —dL/2= Ll —d/2)+V—1>0: 


© dh ae exp — A[Qc(P, a) + ) am? ] 
Ig(P) = a yran | do, ———_—_  —_= = 6(1 — )a) 
oe I i ie [(4n)-Po(a)]*” da 
dN fea [Qc(P, «) + ¥ amp ]*"?-7 
= ea) = 8-6 
r( 2 me I det! d i [(4n)' Po(a]? ‘ _ 
For instance, for p, n = 1, 
re { d*k 1 
oP) = | ony ftp — m+ may + mi? 


1 (n+ p-—4d/2) [? - - ii 
~ 4x? TQ) i dat a *(1 — a)?“ *[a(1 — a)p? + am} + (1 — a)m3]}*? (8-7) 


The whole dependence on d may be explicited by performing the « integration. For low enough d, 
the result is finite. When d — 4, ultraviolet divergences may arise from the Euler function in front of 
(8-6), or from the « integration, or from both. However, for one-loop diagrams such as (8-7), only 
T(I — dL/2) diverges. If I — 2L is a nonpositive integer, then 


aL (- 1) aan a) 
r( 7 *)~; Oba (4—an Ce 


and I¢(P) has a simple pole at d = 4. When internal divergences coming from the « integration 
are also present, this pole may become a multiple one (compare with the examples below). 

For completeness, we have to say how to deal with integrals involving Lorentz vectors and/or 
spinors. The former case does not offer any difficulty. Four-vectors are transformed into d-vectors, 
the integrations are performed, and the result continued to arbitrary dimensions. For instance, again 
in euclidean space, 


i ak ky pr T(n+p—d/2) 
Qn)? [ip — ky? + mi" + ma)? (4x)"? TMT) 


- 
: \ do a(1 ~ a)?" [a1 — op? + omy + (1 — a)m3]2-"P 
0 
(8-9a) 
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| ak kuky " 1 
(2x) [(p — kb? + m3] "(k? + m3)? (4n)*? Tn) T(p) 


1 
x {{ eae "(l= ay * [all — aperemt + (1 — aymigyiee inane Se r(r +p ae 7 
0 2 


i 
+ | da a"**(1 — a)?" '[a(l — a)p? + am? + (1 — amy" rppaP(n +p- 5) (8-9b) 


1) 


In this expression, 6,, is the unit tensor in d-dimensional space, satisfying 
eo a (8-10) 
i 


This condition is maintained in the continuation to noninteger dimensions. Such a prescription 
ensures the consistency of this continuation with algebraic manipulations such as contractions or 
shifts of integration variables. For instance, it is easy to check that 


(= kk, 8” — m3 - (& 1 
(2x [(p — k)? — m3] "(k? — m3)? (2) [(p — kK)? — m7) "(k? — m3)? =} 


Spinors require more care. First, in the original four-dimensional Feynman integrand, we dis- 
tinguish y matrices of fermion loops from those pertaining to fermion lines connected with external 
lines. The latter are reduced by means of projection operators in four dimensions. We therefore have 
to consider for each diagram a collection of form factors involving only y matrices pertaining to loops. 
They are supposed to satisfy the rules 


{Yur Yr} = — 25 yy (8-114) 

(remember that we have performed a Wick rotation and hence our y matrices are now antihermitian) 
tr (odd number of y matrices) = 0 (8-11b) 

tr l= f(d) (8-1ic) 


where f(d) is an arbitrary smooth function satisfying f(4) = 4, for instance, f(d) = 4 or f(d) = d. The 
form of f is irrelevant for the procedure of regularization. (This would not be irrelevant if we really 
wanted to define the theory in d dimensions.) From the rules (8-11), we may reconstruct the whole set 
of identities for contractions and traces of products of y matrices that we listed in the Appendix for 
d = 4in the Minkowskian case. For instance, 


¢ tr yuyv = —f(d) buy 
YuPn = —al 
YuPv¥n = (d — 2)7v 


We have not defined the d-dimensional analog of ys. This is because the usual definition of ys 
appeals to the existence of Eyvpo, the completely antisymmetric tensor which is specific to d = 4. We 
conclude that we cannot continue fermion loops containing an odd number of ys matrices. This 
innocent-looking limitation is the manifestation, in the framework of this dimensional regularization, 
of a serious problem, namely, the possible appearance of chiral anomalies (Chaps. 11 and 12). 

This set of prescriptions may look like a rule of thumb to a skeptical reader. Its consistency, 
though quite likely and checked in everyday computations, has to the best of our knowledge never 
been completely proved. Especially embarrassing is the case of massless theories. For instance, we 
encounter integrals of the form 

ig be 

(2x)* —(k?)" 
that do not depend on any scale. The analytic continuation of such an integral is ill defined since 
there is no dimension d where it is meaningful. It is either infrared or ultraviolet divergent according 
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to whether d + p — 2n < Oor > 0. We shall disregard these problems and adhere, whenever necessary, 
to the prescription that such massless tadpole integrals vanish in the dimensional regularization. 

In computations with this regularization, we must remember that in d dimensions some coupling 
constants may acquire a dimension. For instance, the electric charge, the coefficient of 


[ax Aylx) U(x)" W(x) 
in the dimensionless action (h = 1) has a dimension in a mass scale 
d-2 4-d 
— —_ _ 1 — ea 
[e] =d—(d —1) 5 5 


since [] = (d — 1)/2and[A] = (d — 2)/2. We therefore replace e by :'*~“’ *e’, where e’ is dimensionless 
and yp is either one of the masses of the problem or an arbitrary energy scale if all particles are mass- 
less. When expanding the result near d = 4, we will therefore generate logarithms of this scale. 

Whether in the neighborhood of d = 4 we also expand the factors (4n)~‘*? in Eq. (8-6) is a 
matter of taste, as is the definition of f(d) in (8-11c). The important point is that in computing 
counterterms in a gauge-invariant way, or in comparing diagrams to check Ward identities, we 
always consider classes of diagrams with the same total number of loops L, and hence the same 
power of 4z, and the same number of fermion loops, and hence the same power of f(d). A change of 
prescription will not affect the validity of Ward identities and will only modify counterterms by a 
finite amount. 

As an illustration consider the vacuum polarization in scalar electrodynamics. Feynman rules 
for this theory have been given in Chap. 6. After a Wick rotation (for an euclidean external momentum 
p) the two diagrams of Fig. 8-2 give the contributions 


dék 5 2025 d 
Te =. 292) ee ee (1-$ 2)d/2~1 
wm 26 lon Pam? Gn 5) m) 


r= 6 | d*k (2k + p),(2k + p), 
oo 


7 (2x)4 [(p + k)? + m?](k? + m?) 


e? 


1 
= Gn pont ( ~ 5) i do [a(1 — a)p? + m?]¢/?-! 


1 
as pap.t ( = all do (1 — 2a)? [a(1 — a)p? + mya] 
0 


According to Eq. (8-8), let us isolate the pole at d = 4 in the sum FP + F®: 


a me | g 
(ay 4243 (puPy — p?5,.) + regular terms (8-12) 


b 
re +r = 


We verify that (a) the divergent terms proportional to m? have cancelled in the sum, and (b) the 
tensor structure of the divergence, and hence of the counterterm, is transverse to p. Both results 
are in agreement with what was expected from gauge invariance. Computation of the finite part is 
left to the reader, 


Figure 8-2 One-loop vacuum polariza- 
ptk tion in scalar electrodynamics. 


RENORMALIZATION 379 


Other regularization schemes may be invented. The important poini is that the 
final renormalized result does not depend on the choice of regularization. Strictly 
speaking, finding a regularization and showing that it makes all diagrams finite 
is not sufficient. We should also prove that the structure of divergences is such 
that they may be removed by acceptable counterterms, i.e., by local and hermitian 
polynomials in the fields. Divergences of the form log p? log A2, for instance, or 
with a complex coefficient would be catastrophes. At the very end we shall, how- 
ever, proceed in a different way. We shall prove that sufficiently subtracted 
Feynman integrands lead to a finite theory. Since these subtractions do correspond 
to the introduction of acceptable counterterms, the result will follow a posteriori. 
In other words, no regularization will be involved in the forthcoming proof of 
finiteness. However, its outcome and meaning would be obscured without recourse 
to an implicit regularization, which is therefore a convenient and very common 
tool. 


8-1-3 Power Counting 


We have already used the concept of superficial degree of divergence whenever 
we have based arguments about the convergence of Feynman integrals on dimen- 
sional considerations. Here we develop this concept in a systematic way. 

In this section and the following, we shall only face the problem of ultraviolet 
divergences, and postpone to a further study the possible infrared troubles arising 
from the masslessness of some particles. To be specific, we assume for the time 
being that all fields are massive. 

A naive way to estimate whether a given Feynman diagram G is convergent 
is to dilate simultaneously its internal momenta by a common factor A, k; > Ak; 
and to look for its behavior Ig ~ 2° as A goes to infinity. In the parametric 
representation, this amounts to investigating the integrand when all the « tend to 
zero at the same rate «> A~*a,. We expect, and will soon prove, that if the 
overall power of A, called the superficial degree of divergence and denoted O, 
is nonnegative, w > 0, the integral is generally divergent. If it is negative, some 
subintegrations may still be divergent, and the integral is said to be superficially 
convergent. 

We consider a theory involving spin 0 or 1 boson fields and spin $ fermion 
fields. The fermion propagators behave for large momentum Ak as A~' and the 
boson ones as A” ?. Here we assume that massive vector fields have a propagator 
in the Stueckelberg gauge [Eq. (3-147)]. We shall reexamine in Chap. 12 what 
happens to massive fields coupled to nonconserved currents. The case of higher- 
spin fields will not be considered here. 

We use the notations of Chap. 6. A vertex v of the diagram G carries a 
power /* if the corresponding term in the interaction lagrangian involves Holy 
derivatives. Each integration d*q over the loop momenta contributes a power a. 
If L denotes the number of independent loops, Ig and I the number of internal 
boson and fermion lines respectively, and V the total number of vertices, the 
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superficial degree of divergence w of G reads 


HG) = ~4L+( Sy i,) = n= 2 (8-13a) 
vertices 
or o(G)-4=3Ilr+2Is+ > (6,-4) (8-13b) 
vertices 
owing the relation (6-69): 
L=IigtIrp+1—V (8-14) 


The quantity 6, counts the number of derivatives acting on the fields at vertex 
v contracted to give internal propagators. If f, (respectively b,) denotes the number 
of internal fermion (respectively boson) lines incident to the vertex r, we obviously 
have 


lr=40 f, and In=4¥b, (8-15) 


since every internal line is counted twice in the sum over the vertices. Thus 
Eq. (8-13b) may be rewritten 


~4=5(6,-4) (8-13c) 


where O, = 6, + $f, + b, (8-16) 


The interpretation of this index attached to the vertex v is clear if we remember 
the dimensional considerations of Sec. 6-2-1. A spin + fermion field is given the 
dimension # in mass scale and a boson field the dimension 1. Therefore w, is 
the contribution to the dimension of the interaction monomial of the f, internal 
fermions, b, internal bosons, and 6, derivatives on internal fields. Alternatively, 
if w, is the dimension of the monomial of ¥,,, attached to the vertex v, including 
external and internal fields, that is, 


= total number of boson fields + # times the total number of fermion fields 
+ total number of field derivatives (8-17) 


and if Ey and Eg stand for the number of external fermion and boson lines of 
the diagram, it is clear from (8-13c) that 
o(G)-4= ) (@,—4)-FEr— Es—6 (8-18) 
vertices 
where 6 is the total power of external momenta factorized from the Feynman 
integral. Ofcourse, the dimension w, of the vertex does not include the contribution 


of the dimensioned coupling constant pertaining to this vertex, g,. As already 
noticed in Chap. 6, 


@, + [g.] =4 (8-19) 


It is not surprising to see that the convergence of Feynman integrals has to 
do with the dimension of the coupling constants. Indeed, if w, < 4 for all vertices, 
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the coupling constant g, has a positive dimension. Going to higher and higher 
orders in perturbation theory yields more and more powers of g and forces the 
Feynman integrand to vanish faster and faster at large momenta in order to 
maintain the total dimension fixed. Conversely, ifall w, > 4, we expect the integrals 
to be more and more divergent. All these concepts apply to any subdiagram as 
well. In general, a diagram with «w(G) > 0 is called superficially divergent. 

We are now led to a classification of field theories into three classes: 


1. Nonrenormalizable theories are those containing at least an interaction mono- 
mial of degree w, > 4. For a given Green function Eq. (8-18) shows that the 
superficial degree of divergence grows with the number of vertices, i.e., with the 
order of perturbation theory. Any function becomes divergent at a high enough 
order, as will be demonstrated below. 

2. Renormalizable theories are the most interesting. All their interaction mono- 
mials have w, < 4, and at least one of them has w, = 4. If all monomials have 
@, = 4, we see from (8-18) that all diagrams contributing to a given function 
have the same degree of divergence. Only a finite number of Green’s functions 
gives rise to overall divergences. 

3. Super-renormalizable theories have only vertices with w, < 4. The degree of 
divergence decreases with the order of perturbation. Such theories have only a 
finite number of divergent diagrams. 


The word “nonrenormalizable” may be misleading. It does not mean that 
such theories cannot be made finite but rather that the proliferation of their 
divergences, and hence of counterterms, make them unrealistic in the framework 
of perturbation theory. After renormalization they will depend on an infinite set 
of arbitrary parameters barring any deeper principle allowing to relate them. 
We shall not consider them any more. On the other hand, super-renormalizable 
theories form a too restricted class and are often pathological. 

From the rule (8-17), it is easy to list by simple inspection all the possible 
renormalizable or super-renormalizable theories. We have to construct all possible 
interaction monomials that are Lorentz scalars, hermitian, and have a degree w, 
less or equal to 4, from derivatives, scalar fields g, Dirac fields w, and vector 
fields A, (supposed to be endowed with a Stueckelberg propagator). The mono- 
mials Ye or Wyse, —* or (A?), PAW (or perhaps WAys), 91,04", G'QA?, 
as well as monomials usually appearing in the kinetic lagrangian, Wéy, (d¢)’, 
(0,,A,)’, (0, A"), have w, = 4 and exhaust the list of renormalizable monomials, 
up to the introduction of several species of each field and possible internal sym- 
metries. The terms 9°, ww, Wysw, 7, A? have w, = 3 or 2, and thus lead to 
super-renormalizable lagrangians. Among nonrenormalizable > theories, let us quote 
the pseudovector coupling Wy,ysw0°@, the Fermi coupling Wy,(1 — ys)wwy(1 — 
ys)W, or higher-degree monomials in ~: @°, °°, etc. 

The previous analysis and list have been presented for a four-dimensional 
space-time. For purposes of generalization, or for the needs of statistical mechanics, 
it may be necessary to extend the analysis to different dimensions. The reader 
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will find no difficulty in deriving the analog of Eqs. (8-13) to (8-18) or in deter- 
mining in which dimension the Fermi theory or the scalar theories ¢*, 9°, 
(0,,~)° P(@), where P is an arbitrary polynomial, are renormalizable. 


8-1-4 Convergence Theorem 


Let us analyze more closely the relation between the convergence of Feynman 
integrals and power counting. We first note that as long as the propagators 
contain an imaginary part ie it is equivalent to study the convergence in 
minkowskian or in euclidean space; only the latter case will be considered 
here. 


The limit e— +0 has been studied by Bogoliubov and Parasiuk and by Hepp. They have shown 
the equivalence between the absolute convergence in the euclidean region defining an analytic function 
of the external momenta and the convergence of the corresponding Feynman integral toward a 
tempered (i.e., polynomially bounded) distribution in Minkowski space in the limit ¢ > +0. 


We use here a restrictive definition of a subdiagram g of a diagram G as a 
subset of vertices of G and of all the internal lines joining them in G. To each 
connected proper (i.e., one-particle irreducible) diagram, we associate the family 7 
of all its connected proper subdiagrams. Of course, ¥ contains G itself. 


Theorem Ifw(g) <0 for all g ¢ F, then the Feynman integral corresponding 
to G is absolutely convergent (in the euclidean region). 


To simplify the expressions, we only present the proof in the case of a scalar theory without 
derivative couplings, using the parametric representation. In the euclidean region, it reads 


5 = 2 
1a) = | dey da; oe er eed 


: (4n)*=P(a)? 20) 


We assume that m7 > 0 for all / and recall that Q is a quadratic, positive definite form in 
the external momenta and a homogeneous rational fraction of degree 1 in the x. Hence, only 
the convergence at « = 0 is questionable. The exponent being bounded at the origin plays no 
role. The polynomial # is a sum of monomials of degree L [compare with (6-86)]: 


Aa= Y [Jou (8-21) 


trees 7 1¢.7 
Following Hepp, we divide the integration domain into sectors 


0 << Op, S Om S60 Sy, 


where x is a permutation of (1, 2,...,). We shall prove the convergence of the integral sector 
by sector. To each sector corresponds a family of nested subsets y: of lines of G (not necessarily 
subdiagrams): 


Yi © ¥2 C7 Cpr =BG 


where y; contains the lines pertaining to (a,,,..., @,,). For notational simplicity we present the 
reasoning for the first sector corresponding to the identical permutation. In this sector, we 
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perform a change of variables 


a = BT B3 Bi 
%2= 3 Bi 
Pee neces ete nse sccceeesese (8-22) 
Oy—4 Bi- 1 BF 
ay = Bi 
the jacobian of which is 
D(ay,..., 4) 


—7!p p3... pzI-1 
Dib... Ba 2°B1B2°** Bt 


In these 8 variables the integration domain A reads 
0<frsm and O<f,<1 fori</<I-1 


We may define the superficial degree of divergence of each y,, even though », is not necessarily 
connected: 


@, = w(y:) = 2L; — 4h, (8-23) 
where J, = / is the number of internal lines of ;. The number of independent loops L, reads 
L,=1,+C,- V; (8-24) 


in terms of the number of connected parts C; and vertices V;. This generalization of formula 
(6-69) may be easily proved by induction. Of course, we have L; = 0, L; = L. As a function 
of the 8, ? is a polynomial, the distinct monomials of which have coefficients equal to unity. 
Let us prove that it is of the form 


P = Bi p3%--- BI" [1 + O(B)] (8-25) 
First, when all the « are simultaneously dilated by a factor p, 
P(pay,..., pas) = p*P lay,..., ot) 
Let us now investigate the behavior of 7 when only the « belonging to y, are dilated by a factor p: 
PAO rs. «es POO CT) (8-26) 


Equation (8-21) expresses F in terms of trees on G. Each tree 7 of G projects on y, along the 
union of C; > C, connected trees F,,...,F%¢,. This will contribute in (8-21) a term behaving 
under the dilatation (8-26) as a power of p equal to the number of lines of y, that do not 
belong to 


U=F,ITzI Te. 


Since the set Y satisfies an equation of the type (8-24) with a vanishing number of loops (it is 
a union of trees) and joins all vertices of y;, its total number of lines is 


Iy=VY,—C; 
Hence the power of p in the corresponding monomial of F is I, —Iy=!1-W+Ci2zl1-VYUit+ 
C, = L;. The lowest power of p is reached by the trees 7 of G such that C;=C,, that is, 
by these trees that project according to 


a 


c, 
=U 
Proj, Yi i 


All these trees 7 may be generated by constructing independent connected trees in each con- 
nected part of , and completing their union into a tree of G. Therefore, their total contribution 
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to P factorizes into Y, that is, the polynomial P attached to y, by the rule (8-21), times 
Po,,, where the reduced diagram G/y, is obtained by contracting all the lines and vertices of 
each connected part of y, into a unique vertex. We thus conclude that 


P( pets, .. +5 PO, Cr+ 1,--+, 01) = p'[P, (ar, ---, 0) Poyy, (ons 1p +s. Oe O(p)] (8-27) 


We now return to the variables f. Obviously 7 is the homogeneity factor of the parameters 
pertaining to y,; = G, B7 , that of y; 4,..., etc. Since the + are nested, Eq. (8-27) may be used 
repeatedly: 


P( Bi Bz --- BF, 63: (eae 7-1 Bi, Bi): 

= By eaBy B71. PRee Peis. 0B F151) 

= PPB [D,,_,(B2 °°: Bl-2, B+ * BF-2,---» BE-2, Poy, ,(1) + O(B7-1)] 

=e (8-28) 
with the result (8-25). The coefficient in front of the leading term is 1, since a unique monomial 
of the initial polynomial A may have this combination of powers of the £. We return to the 
integral (8-20) in the sector A. Under the assumption of the theorem, namely, that w(g) <0 


for any proper diagram g, it is easy to show that w(;) <0 for any subset 7, proper or not. 
Therefore the integrand is majorized up to a factor by 


Bi p3-:: pi I I 
ie GTS ie * = eee 
Bets pale... 4L, I ! 

1 P2 I I=1 1=1 


and since w, < 0, the integral le Ii df, B, “ * is absolutely convergent at the origin. We con- 
clude that the integral (8-20) isabsolutely convergent in every sector. QED 


For a scalar theory without derivative coupling, the foregoing proof has also 
shown that as soon as a proper subdiagram has a nonnegative superficial degree 
of divergence the Feynman integral is divergent. Indeed, there is a divergence 1n at 
least one sector and, the integral being positive definite, no cancellation may 
occur. On the other hand, we have encountered examples in electrodynamics 
of cancellations between different terms appearing in the numerator of the 
Feynman integrand in momentum space. For instance, we have shown that the 
vacuum polarization is only logarithmically rather than quadratically divergent 
and that light-by-light scattering is convergent. 

Later, we shall have to show that after subtraction of all subdiagrams such 
that w(g) > 0 the integral is absolutely convergent. Notice that the previous 
considerations on absolute convergence justify a posteriori the manipulations— 
interchange of orders of integrations or changes of integration variables—per- 
formed in the derivation of (8-20). As a bonus we also find how many terms 
have to be subtracted to the conventional propagator (8-2) in order to regu- 
larize the theory. For instance, in the y* theory, the replacement (k? — m2)7! > 
(k? — m*)~! — (k? — A*)~* makes every diagram finite but the one-loop tadpole, 
since the superficial degree of divergence now reads 


co(G) = 4L — 41 = 4(1 — V) 


which is negative for V> 1. The one-loop tadpole may be either regularized 
separately or discarded by the use of a Wick ordering prescription. In this way, 
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we end up with a finite regularized theory. Such an analysis may be carried out 
in the case of quantum electrodynamics (see Sec. 8-4-2). 

A useful corollary of the previous theorem is the following. If a diagram 
G has no superficially divergent subdiagram, w(g) < 0 for all subdiagrams g # G, 
but is itself superficially divergent, w(G) > 0, then the divergent part of its amplitude 
is a polynomial of degree less or equal to w(G) in the external momenta P and 
in the internal masses. Indeed, since w(G) measures the degree of homogeneity 
of Ic(P) in the momenta and masses, the [w(G) + 1]-th derivatives with respect 
to the P; and the m, have a degree minus one, and hence are superficially con- 
vergent. By virtue of the theorem, the derivatives [6°*!/(@P)°* "VIe(P) [or 
(6°*'/dm®* ')Ig, or mixed derivatives of this order | are finite. 


That our proof of the theorem is still valid is an easy exercise left to the reader. 


Finally, we get 
lGcaglEs m, A) = Ten (P) a D(P, m, A) (8-29) 


where J;,,(P) is finite as the cutoff is removed and D is a polynomial in the P 
and the m of degree less or equal to w(G). 


8-2 RENORMALIZATION 


8-2-1 Normalization Conditions and Structure of the Counterterms 


We shall now come to grips with the renormalization operation itself. Let us 
once again repeat the general ideas and emphasize the outcome of this operation. 
Our aim is to express the proper Green functions in terms of renormalized 
Feynman integrals associated with the initial diagrams. This may be achieved 
by means of three equivalent procedures. In the first approach, the one presented 
in Chap. 7, we add to the initial lagrangian a formal series (in h) of counter- 
terms. This in turn amounts to an order-by-order redefinition of the parameters 
of the theory: the “bare” parameters appearing in the lagrangian are implicit 
functions of the renormalized ones. The former are unobservable and divergent 
as the regularization is removed, while the latter are the real finite parameters 
of the theory, mass, coupling constants, etc. Finally, we will see in the next sub- 
sections that these two procedures are equivalent to an algorithm of subtraction 
of the integrand. This operation, due to Bogoliubov, has the merit of providing, 
diagram by diagram, a finite result without any intermediate recourse to a 
regularization. 

We will try to juggle as skilfully as possible with these three equivalent 
approaches and to use the most appropriate one for each problem. For instance, 
we shall first discuss the recursive construction of the counterterms and then 
use the relation between bare and renormalized theories to stress the multiplicative 
character of the renormalization process. Finally, Bogoliubov’s subtraction opera- 
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tion will offer a convenient framework for an (heuristic) proof of the fundamental 
convergence theorem. 

The construction of counterterms proceeds by induction. We assume that 
the theory has been made finite up to a given order h*~' (i.e, a number L — 1 
of loops), through the introduction of judicious counterterms. According to power 
counting [Eqs. (8-13) to (8-18)], to the next order h*, only a finite number of 
proper functions have a nonnegative superficial degree of divergence. Except for 
possible cancellations arising from the existence of symmetries, we associate to 
each of them a local monomial in the fields and their derivatives. It 1s, of course, 
a Lorentz scalar; its structure reflects the nature of the Green function since it 
must contribute itself to the process under study and its extra contribution to 
this function to that order must cancel its divergence. There remains, of course, 
a finite arbitrariness. After introduction of a regularization, the coefficients of the 
counterterms are completely determined by normalization conditions, imposed 
on the superficially divergent Green functions. If these normalization conditions 
are satisfied to lowest order (the tree approximation as implied by the initial 
Lagrangian), demanding that the Green functions satisfy them order by order 
fixes unambiguously not only the infinite but also the finite part of the counter- 
terms. 

For instance, in the renormalizable g* theory of a scalar field of mass m, 
the two-point function I is quadratically divergent [w(I'?’) = 2]. We demand 
that the renormalized I’? satisfies 

TR’(m?) = 0 fa TR '(m?) = 1 (8-30) 
This is a natural condition for the physical mass, since the interpretation of the 
theory in terms of particles requires that the complete propagator G'2)(p?) = 
i[T'?(p?)]~* has a pole of residue i at p? = m*. To order L, the regularized 


function (@) already renormalized up to order L — 1 reads 


T2)(p?) = T2(p?, m?, A2) + p?AiPm?2, A?) + AF (mn?, A2) (8-31) 
where I'{))(p?, m?, A?) is finite as A? + 00, whereas A,I?(m?, A?) behaves at 


worst as a power of In (A*/m’), and A,I as A? [times In? (A?/m?)]. The 
counterterm to order L reads 


fey _ (0@)" Om 
AL"! = a 2 a bm? 7 (8-32) 

hence giving to [ an extra contribution of the form 
AT?) = ap? — bm? (8-33) 


Since the conditions (8-30) are already satisfied to lowest order, they yield to 
order L: 


Pp? = m?) = Tp? = m?) + (a — b)m? = 0 


8-34) 
d are : 
| — ae (| re an 


Op om Op? - 


— 2 
=m- 
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and therefore determine a and b. A similar normalization condition is imposed 
on the other superficially divergent function, namely, the four-point function of 
the y* theory. A physically sensible and symmetric condition consists in requiring 
that this function takes the value —/ (the renormalized coupling constant) at 
the on-shell (but unphysical) point S,,,: 


Pi=p2=p3=pa=m?> TR |s, = —-A 
a ree: (8-35) 
$= f= 
s) 
This is clearly in agreement with the lowest-order value F'‘#!°! = — 4. Analogous 


normalization conditions can and must be introduced in any renormalizable 
theory. 

Of course, the foregoing is by no means a proof that divergences may be 
disposed of by counterterms. In particular, we have not completely proved that 
o() = 2 implies that IY) has the behavior (8-31). Indeed, we are not really 
in the conditions of the corollary (8-29), since all subdiagrams are not necessarily 
superficially convergent, but may have been renormalized by lower-order counter- 
terms. That our reasoning is nevertheless correct will actually follow a posteriori, 
when we will have shown that this procedure does lead to a finite renormalized 
theory. Here we only want to recall the logic of the method and stress the 
necessity of normalization conditions. At this point, it may be worth recalling 
that the difference between renormalizable and nonrenormalizable theories lies 
in the number of these conditions. While in the former a finite number of con- 
ditions suffice to define the theory in terms of a finite number of renormalized 
parameters, the renormalization of the latter requires an infinite set of such con- 
ditions ; ultimately, a nonrenormalizable theory will depend on an infinite number 
of parameters. 

There is a great amount of arbitrariness in the choice of the normalization 
conditions. The only proviso is that they must be satisfied to lowest order, so as 
to fix unambiguously the subtractions to higher orders. We shall reexamine this 
point in Sec. 8-2-5 below. Owing to this arbitrariness, it may be more convenient 
to use a less physical, intermediate renormalization. When all the fields have a 
nonvanishing mass, it is safe to choose normalization conditions at the origin 
in momentum space. In the above example of the g* theory, the conditions 
will read 


d 112) 


TR eh) = 1 


T2\p2)| aap = —m?2 ae 
£ (P| p=0 dp? (8-36) 


ES (0.0.0 0-4 


The quantity m? as defined by (8-36) is no longer the physical mass, even though 
it is related to it. 

When dealing with particles of nonzero spin, the tensor structure of Green’s 
functions must be taken into account. It may happen that only the form factors 
of some tensors are divergent (for instance, in the study of the vertex function 
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in the last chapter, F, was found finite and F, divergent). Only the latter requires 
subtractions and normalization conditions. 

A consequence of the previous recursive construction of the counterterms 
concerns their structure. In a renormalizable theory, the counterterms satisfy the 
criterion of renormalizability. This is what we found in spinor electrodynamics 
to the one-loop approximation, where the only counterterms had the form 


(OnAy— vA): iw: Wa: AW 

Similarly, in the g* theory, the counterterms are :(09)*:, :97:, :@*: of degnee 
less or equal to four. In scalar electrodynamics, the situation is slightly different, 
since starting from the monomials generated by minimal coupling we find counter- 
terms of the same structure, plus a new term of the type :(g'g)*:. Therefore, 
we end up with a mixture of electrodynamics and ¢* self-coupling, but the theory 
remains renormalizable. This also means that scalar electrodynamics depends on 
a supplementary unexpected parameter, namely, the value of the four-point func- 
tion at a given point. 

More generally, if a proper diagram G of a renormalizable theory is super- 
ficially divergent, 


0<w(G)<4-3Er—Es—6 


[ we use Eq. (8-18), where w, < 4], the corresponding counterterm has Er fermion 
fields, Ez boson fields, and a number 6 of derivatives. Therefore, its dimension, 
in the sense of Eq. (8-17) is 


@,, = 3Ep + Ep + 6 <4— AG) (8-37) 


The counterterm thus generated has a dimension less or equal to four; hence 
it is also renormalizable. 

Whenever the counterterms have the same structure as monomials of the 
initial lagrangian, they may be considered as redefining the parameters of the 
theory. The quantities appearing in the lagrangian, resulting from the addition of 
counterterms to the initial monomials, will be referred to as bare parameters. 
The bare parameters are determined order by order in perturbation theory as 
functions of the renormalized quantities, so that the renormalization conditions 
are satisfied. We discussed this construction in the previous chapter for electro- 
dynamics (see also Sec. 8-4 below). In the case of the g* theory, we write the 
lagrangian plus its counterterms as 


i m? A 
L+AP = Lp=5 (69! — > P -F Q 


1 1 1 
+5 (Z— N0x9)? ~ 5 (Zim — m2) 9? — = (ZA - Doo" 
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1 ms A 
5 Ono)" — > 93 — 71% (8-38) 
where Qo = Z'@ 


This is sometimes referred to as the renormalized lagrangian, an unfortunate 
denomination, since its coefficients are infinite. The lagrangian “x has the same 
expression as the initial one, up to the replacements @ > @o, A > Ap, m> mo. 
We recall that in perturbation theory 


Ao(A, A/m) = A[1 + O(Ah)] 
mo(A, A/m) 
m 

Z(A, A/m) = 1 + O(A2h2) 


The last relation is particular to the g* interaction. A priori we would expect 
Z=1+ O(h). 

The fact that renormalization boils down to a redefinition of the parameters 
implies that unrenormalized (or bare) and renormalized Green functions are 
related through 


Geis ++e5Dn, M, d) = ZAG (ps, --+>Pn, Mo, Ao: A) 
TO (1, e »» Pas m, A) = r Aa) ** -> Dns Mo, Ao; A) 


= 1+ O(Ah) (8-39) 


(8-40) 


This relation holds for connected and proper functions respectively. Indeed, the 
connected renormalized functions are computed from the lagrangian p= + 
A by adding a source term jg, while the unrenormalized ones may be con- 
puted from #z (in the presence of a regularization) by coupling a source to 
Yo, that is, j~o. Taking n derivatives with respect to j leads to the announced 
relation. The limit A — oo is understood on the right-hand side of Eqs. (8-40). 
Finally, proper functions are obtained after elimination of one-particle reducible 
diagrams, an operation which commutes with renormalization, and amputation 
by the renormalized or unrenormalized propagator respectively—whence the 
change in the power of Z. 


8-2-2 Bogoliubov’s Recursion Formula 


In the preceding subsection, counterterms have been associated to proper func- 
tions, i.e., to a sum of Feynman diagrams computed to a given order. The 
existence of normalization conditions enables us to associate a counterterm to 
each superficially divergent diagram, computed using the requirement that it 
satisfies the normalization condition. A subtlety is involved when owing to can- 
cellations caused by symmetries (Bose or Fermi symmetry, internal symmetry, 
etc.) individual diagrams may be more divergent than their sum to a given order. 
In that case, it is safer to consider only sets of diagrams—gauge-invariant sets 
in quantum electrodynamics—that exhibit these cancellations. We shall avoid 
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this unessential complication in the sequel by restricting ourselves to a scalar 
nonderivative theory. é 

Let us now investigate the effect of lower-order counterterms on a given 
proper Feynman diagram G. Indeed, G may contain superficially divergent proper 
subdiagrams y: w(y) > 0. To each of these subdiagrams we may associate a counter- 
term of order h“: (L, being the number of loops of 7). Let %¢ denote the integrand 
of the Feynman diagram G in momentum space, #4; the one when all lower- 
order counterterms are taken into account, and #, the renormalized integrand 
that leads to a finite integral. If G is superficially convergent, 


Re=Re for w(G) <0 (8-41) 


However, if w(G) > 0, Bg differs from Ac by the contribution of the counter- 
term attached to G itself. Alternatively, #¢ has to be subtracted to yield Be 
that leads to a finite integral satisfying the normalization condition. After integra- 
tion over the internal momenta [(Z¢ — Zc) must be a polynomial of degree 
less or equal to w(G) in the independent external momenta of G. In the framework 
of the intermediate renormalization (subtraction at zero momentum), Ro — RG 
will be nothing but the Taylor expansion Tg Aco of Ac in the external momenta 
at the origin, up to the order «(G) included. We shall write 


Ro=(1-Te)Re for w(G)>0 (8-42) 


We are left with the problem of relating Ag to the initial integrand 4g. The 
difference between 2g and .%g comes from the counterterms associated with the 
renormalization parts y of G. Following Zimmermann, this refers to proper 
superficially divergent subdiagrams, hence containing all the lines of G that join 
two of their vertices, according to our convention of Sec. 8-1-4. The contribution 
of the counterterm associated with y is — T,#,, and when inserted in the diagram 
G in place of y this gives 


Fo1(—T,2,) (8-43) 


Here 7,2, denotes the Taylor expansion of the modified integrand R,, in the 
independent external momenta of 7, up to the order (7) (included). 


This Taylor expansion is not as well defined as in the case of G itself, since the distinction between 
internal and external independent momenta requires more care. We shall admit here that a definition 
of these external variables is always possible and refer the scrupulous reader to the references quoted 
in the notes. 


On the other hand, %¢,, stands for the contribution to the integrand of the 
lines and vertices external to y in the initial diagram. It contains in particular 
the propagators pertaining to the lines that join y to the rest of G. (See Fig. 8-3 
for an illustration.) Equation (8-43) contains the contribution of the counterterm 
relative to y (and possibly of those contributing to R,). It is now easy to write 
the contribution of two counterterms relative to two disjoint renormalization 
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Figure 8-3 Subtracting an internal renormalization part » in the integrand ¥. 


parts y; and yz, where disjoint means that they have no common line or vertex: 
¥1 Ay2 = ©. This contribution reads 


Feit, ale T, ‘a (5s 6. 2,,) (8-44) 
where .% .., 


Gity.y, Tefers to the contribution to the initial integrand of all lines and 
vertices of G except for those belonging to y; or y2. The reason why we only 
consider disjoint renormalization parts is that otherwise both could not be replaced 
by their corresponding counterterm. This enumeration of the possible contribu- 
tions to #¢ may be pursued. The resulting recursion formula, due to Bogoliuboy, 
reads 


Re=Iotr+ Y | Foityssondsd I] (—T,,2,,) (8-45) 
UV taeres Ps as 
YaNVo= OD 


The first term on the right-hand side is the initial integrand, and the sum runs 
over all families of disjoint renormalization parts. Equation (8-45) after iteration 
gives, together with Eqs. (8-41) and (8-42), the expression of the renormalized 
integrand. If G contains no superficially divergent proper subdiagram, then 2g = 
4. This was the case of all diagrams studied in Chap. 7. For one-loop diagrams, 
either w(G) < O and &, = 4g, or w(G) > O and & = (1 — Te) %o. 


Let us now present simple examples involving more loops. For the sake of simplicity, we consider 
the theory of a scalar field y endowed with the interaction ¥,, = —Ag*/3! in a six-dimensional 
space-time. Though such a theory suffers from serious pathologies—its hamiltonian is not bounded 
from below—it does make sense to develop its perturbative expansion. It is a renormalizable theory 
(in six dimensions). The power counting of Eq. (8-18) is replaced by w(G) = 6L — 21 = 6 — 2E. Consider 
the diagram of Fig. 8-4. The subdiagram y inside the dotted box is the only renormalization part, 


G/y 


Figure 8-4 Nested divergences. 
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Gly, G/y, 


71 G Uf) 


Figure 8-5 Overlapping divergences. 


besides G itself. According to (8-45), 

Tt, + Mey —TR,) 

R, = 5, 
whence we deduce & 

Ro = 5; Mag TS) 

=(1 — ee 
with a slight abuse of notations, and, following (8-42), 
Rg = (1 — Te) Re = (1 — To)(1 — T,) Fo (8-46) 


We see that to the nested diagrams y < G have been associated two factors (1 — T). This would 
still be true if we were to add more (parallel) rungs to this diagram. It is a good exercise to verify 
that the renormalized integrand leads to a finite integral. 

We turn to the diagram of Fig. 8-5. It is superficially divergent [w(G) = 2]. as well as its two 
renormalization parts y, and y2 [w(y,) = w(y2) = 0]. Here a new phenomenon arises; +; and ;'2 are 
neither nested nor disjoint but overlapping. They share one line and two vertices, but none is included 
in the other. Consequently, Eq. (8-45) leads to 

Re = Igt I G1y(— T, ,2,) ar I Giy:(— Te) 

2, = J, 

&R,, =J,, 
If we write T, %¢ = 4), T, %, we finally have 

Re =(1— Ty, — Ty.) 4% 
= (8-47) 
Re = (1 — Te)Re = (1 — Te)(1 — T,, — T,,).%e 


We observe that #g is not equal to (1 — Tg)(1 — T,,)(1 — T,,)%G. The supplementary terms, namely, 
(1 — Tg)(T, ,T,,), just correspond to overlapping subdiagrams. 

This diagram of Fig. 8-5 will be analyzed further in the case of four-dimensional quantum 
electrodynamics in Sec. 8-4-4. 


8-2-3 Zimmermann’s Explicit Solution 


The preceding examples suggest the general solution of the recursion equation 
(8-45). Following Zimmermann, a forest of renormalization parts will be defined 
as a family U of proper superficially divergent subdiagrams y such that 


ify,andy,eU for yoy, (8-48) 
or WAV =O 
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We recall that y; ~ yz = @ means that these subdiagrams have no common vertex 
nor line. A forest may be empty. Moreover, we denote by V the forests of G 
such that, if G is itself superficially divergent, G does not belong to V. Of course, 
if G is not superficially divergent, the two sets of forests {U} and {V} are identical. 

Ifa consistent set of internal momenta has been chosen to make the operations 
T, meaningful, two such operations pertaining to disjoint );, y2 commute, whereas 
ify: < y2, we understand that T,, will stand to the right of T,, [compare the 
examples of Eqs. (8-46) and (8-47)]. Under these circumstances, let us show that 


Re I] (-T) 4% 
ye 


2 I] (—T,)F%o 


yeU 


I 
~ 


- (8-49) 


is the solution of Eq. (8-45). First, we remark that to the empty forest corresponds 
the term %¢. Second, if G is a renormalization part, we may associate to every 
V two forests of the U type, U; = V and U, = {G} UV. Therefore Eq. (8-49) 
leads in this case to 2g = (1 — Tg)#g whereas if w(G) < 0, Bg = Re by definition 
of the V. In either case, this is in agreement with Eqs. (8-41) and (8-42). To 
prove that Eq. (8-49) is indeed the solution to Eq. (8-45), we proceed by induction, 
assuming that it has been proved for arbitrary diagrams up to a given number of 
loops L— 1. If G has L loops, let us write Eq. (8-45) and insert Eq. (8-49) for 
each #,. We get 


Re = Igt+ De S eueipes co [] ae bs I] (- Ss, (8-50) 
{Yasseen Ps} a=1 ' Ly y'eV,, 
YaNWy=O 


The notations are cumbersome but obvious. It is then a matter of a simple 
inspection to check that Eq. (8-50) generates all the terms of (8-49), corresponding 
to forests of extremal elements y,,..., ,. By definition, an extremal element of a 
forest is an element y not included in any other one of the forest. The summation 
over the disjoint sets {y,,..., ys} generates all the forests of G once and only once. 


We have been discrete about the choice of normalization conditions. Since the nature of the Taylor 
subtraction depends on this choice, the most convenient one is the intermediate renormalization 
defined above, i.e., the subtraction at zero momentum. When a different point is chosen, for instance, 
in massless theories where subtracting at zero momentum is forbidden {see Sec. 8-3-1), some care 
is required to maintain Lorentz invariance. 

Supplementary subtractions, i.e., beyond the degree w(G) prescribed by power counting, may be 
performed consistently in an analogous way. We shall not develop this point here, and refer the 
reader to the references (see also some remarks at the end of Sec. 8-2-6). 

The construction of # has involved only disjoint renormalization parts, i.e., without any common 
vertex or line. As a point of consistency, let us show that the renormalized integrand for an articulate 
(or one-vertex reducible) diagram may be factorized (Fig. 8-6). As an example, we consider the diagram 
of Fig. 8-6b of the four-dimensional g* theory. The forests U are 


D, {yi}, {ya}, {Y15 G}, {v2 G}, {G} 


Hence 
Rg =(1— Te — T,, — T,, + TeT,, + TeT,) Fy, 4y2 
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= a 


(a) (b) 


Figure 8-6 Articulate (one-vertex reducible) diagrams. 


For such a diagram, Tg = T,,T,,. Since G, y:, and yz have the same degree of divergence, and since 
a Taylor expansion of degree w does not affect a polynomial of degree w, we have 
Tg = TeT,, = TeT,, = 7,7, 
Thus, &g also reads 
Re =(1 — T,,)4,,(1 — T,) 4, = 2, By, 


This proof is readily generalized. 

It is straightforward to verify that in the case where all the renormalization parts of G are 
nested, Eq. (8-49) reduces to the product of operators (1 — T,) over all the y. The general case consists 
in expanding the same product and omitting all terms corresponding to overlapping subdiagrams. 
These two statements are a mere generalization of the properties found in the particular examples at 
the end of Sec. 8-2-2. 


The result expressed in Eq. (8-49) is a major achievement but is not com- 
pletely satisfactory until we prove that it leads to a convergent integral. Especially 
embarrassing is the case of overlapping divergences. Is the prescription contained 
in Eq. (8-49) really sufficient to provide a finite expression in such instances? 
As we shall see, the answer is “yes.” The proof of this result will render the use 
of an intermediate regularization unnecessary since the subtraction of the inte- 
grand leads to aconvergent Feynman integral. However, it is often more convenient 
to deal with regularized amplitudes rather than with the cumbersome expression 
(8-49). Before giving a sketchy proof of convergence, we will first discuss the form 
of the subtractions in the parametric representation. 


8-2-4 Renormalization in Parametric Space 


It is possible to reexpress the subtraction prescriptions in the parametric representation. This formula- 
tion allows a simpler proof of the convergence theorem. We shall outline the major steps. 


Asa preliminary example, consider again the one-loop self-energy diagram (Fig. 8-7) of the six- 
dimensional ¢? theory. Its value in the euclidean region is 


e~ Im*(a + a2) + O(P,a)] 


oe fea} 
pa {, da, da Pa 


with P(x) = 0, + a2, O(P, «) = [o02/(01 + a2)|P*. We find a quadratic divergence due to the non- 


Figure 8-7 The bubble diagram. 
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integrability at the origin of the parametric space. If we use the intermediate renormalization, the 


renormalized form will be obtained after subtraction of the truncated Taylor expansion in P? at the 
. origin 


dl 
Ip(P?) = I(P?) — 1(0) — P? —— 
a(P?) = I(P?) — 1(0) — P? = 
co e) ™*(a1 +42) Sete d 
=i a pe MARE cara.) Gea) 


Under a simultaneous dilatation of the parameters «,, x by a factor A, the subtracted integrand 

will behave as A~'. Owing to the extra factor 4 arising from the measure da, da, the integral will 

be convergent at the origin. Now the subtraction in P*? may be transmuted into a subtraction over 
} the parameter 4, if we recall the homogeneity properties of Q and Y and observe that 


Q(oP, «) = O(P, p?a) 


and [P(a)]}~? Ge — e~ X02) _ p2 = lt 
2 1 oa" 
= [P(a)]~ > eo AP) — = a - {[P(a)]~ 3 J estaedog oes 


2 
= [Pla] 9 err 


= = aniye e APP) T Ppa) |- aN oro (8-52) 


This explicit example leads us to the following definition. Let f(g) be a function of p such that 
p’f(p) is differentiable at the origin for some integer p. We define the operator 7 as 


k+pi s 
7s i i p* 1 
Pfipyap=” 2% at ap [e"'f(e)]p=-0 Wrri=p (8-53) 


with k integer. It is straightforward to check that this definition does not depend on p, > p and 
that it could be generalized to p noninteger. The expression (8-53) makes sense provided k + p > 0, 
which allows k to be negative. By convention, we set 


F*=0 ifk+p<0 
The essential property of this generalized Taylor expansion is that 
(1 — F*) f(p) ~ O(p**?) (8-54) 
In the previous example, the renormalized integrand reads 
(1 =F gSaiaiipea)]} > POY 


where the notation implies that the subtractions are performed at p = 0 and the function is then 


evaluated at p = 1. 
These operations are generalized to arbitrary situations. Returning to a four-dimensional theory, 
the translation of formulas (8-49) yields the following renormalized amplitude: 


Ip(P) = a ~~ I] (da, € — 42) R(P, or) 
(8-55) 
A(P, a) = y ut i VATE (a ican ope 


The a? p, acts on the parameters «, pertaining to the renormalization part y, after a rescaling 
at; > p20; I, denotes the number of internal lines of y. The subtractions are performed at p, = 0 and 
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D 


Figure 8-8 Overlapping divergences to be studied in parametric space. 


the result evaluated at p, = 1. Finally, these operations are performed on all the ; belonging to a 
forest U, and a summation over the U is carried out (U = @ corresponds to the identity with no 
subtraction). In Eq. (8-55), we have taken for simplicity a scalar theory without derivative couplings, 
considered in the euclidean region and renormalized at the origin in momentum space. The generaliza- 
tion to nonzero spin, derivative couplings, higher dimensions, etc. offers no difficulty -only for 
notational ones. 

This formulation of subtractions in parametric space enjoys remarkable algebraic properties. 
It may be shown that the complete expression (8-55) is independent of the ordering of the Taylor 
subtractions, although two particular 7 pertaining to overlapping subdiagrams do not commute in 
general. Moreover, for a given diagram, or a finite set of diagrams, there exists an upper bound on 
the powers p. Consequently, the operators 7, depend only on the number of internal lines J., 
and no longer on the particular topology of the diagram. 

Even the last reference to the topology in (8-55), namely, the enumeration of forests and re- 
normalization parts, may be disposed of. First, we prove that 


AP, ow) = T1(1 — F524) {[ Pla]? e- 87} |, 24 (8-56) 


where the product runs over all the renormalization parts of the diagram. In other words, the sub- 
tractions pertaining to overlapping subdiagrams drop out. As an illustration let us verify this property 
on the example of Fig. 8-8. In four-dimensional space, the diagram G together with the subdiagrams 
¥1, Y2, and y are renormalization parts. Showing the equivalence between Eqs. (8-55) and (8-56) 
amounts to proving the identity 


= el — Fer NeaT 5?" ja = ay 21 )(— 7 5 **)\ iam e poe a 


where the dilatation parameter for G has been denoted /. If f stands for the function between brackets. 
it satisfies ; 
f= f(Apip2p, Api, Ap2) 


since only those « belonging to y are simultaneously dilated by p? and p3. Therefore, the action of 
7, yields 
—F "f= VY (pad falorp. Ap2) 
k<-2h, 
(The lower bound on k depends on the theory; in a scalar theory it is —4L,, in terms of the number 
of loops of y2.) Then 


(La, -F) f= > wees euel elion 
k<—-21,, 
m>—2h, 
and ; 
(-F, = Fp eZ, i= Y Cee Cae.. 

k<-2h,, 
m>—2I, 

HERTS 74 b 


Each individual term in the sum is homogeneous of degree k + n in J. But 
k+ns —-2(h,+4,)-—ms -2(L,+ ],—-1)= —2le 


Therefore the action of the last subtraction (1 — 7; 2's) which retains only the terms of degree 
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larger than — 21g gives zero. It is a mere exercise of bookkeeping to generalize this proof to arbitrary 
Situations. 

Finally, it is easy to show that if we perform subtractions on a homogeneity scale of subsets of 
parameters that do not correspond to superficially divergent proper subdiagrams, their effect drops 
_ Out in the complete expression. Therefore, the final result of this analysis is 


RP, a) = [] (1 — Fyz2)[P-2(a) e MP], (8-57) 


where the product runs over the (2% — 1) nonempty sets of « parameters. Again, the result does not 
depend on the ordering. The merit of this last expression is twofold. First, it is independent of the 
topology of the diagram. Second, it enables us to understand, at least qualitatively, the arguments 
for the proof of convergence. Indeed, for any family g of parameters w, we may bring the operator 
a —E. *44) to the left of the product in (8-57), and it follows from (8-54) that the integrand behaves as 


p,”'’"* and hence is integrable in p, since the measure still contains a factor pz"! dp,. That the 


possible singularity of an arbitrary subset of the « is integrable is unfortunately insufficient to insure 
the convergence of the integral. For instance, in the integral 


i 1 2a, 
da, da, —--—— 
\, [ Be rial) 


the singularities corresponding to x, > 0, x. # 0, or to x2 > 0, a, ¥ 0, or to %, ~ & — 0 are integrable, 
but integration over «, leads to the divergent integral 


|, da 
0 %2(1 + a2) 
The proof that such phenomena do not occur for Feynman integrals is tedious. We will not reproduce 


it here. As in Sec. 8-1-4, we have to divide the integration domain into sectors and appeal to the 
homogeneity properties of the parametric functions. 


We conclude this long and technical analysis by stating once again the 
important result of the Bogoliubov-Parasiuk-Hepp theorem. The subtraction 
operation described in Eqs. (8-45), (8-49), or (8-57) yields an absolutely convergent 
integral, and defines an analytic function of the momenta in the euclidean region 
and a tempered distribution in the Minkowski domain. 


8-2-5 Finite Renormalizations 


We have considered so far the subtractions of infinities. But the previous develop- 
ments on the construction of counterterms, on the multiplicative character of 
the renormalization, or on the algebra of subtractions apply as well to finite 
renormalizations. This term refers to the operations required to modify the 
normalization conditions, thereby changing by a finite amount the (renormalized) 
parameters of the theory. Such is the case for instance, if we want to modify the 
normalization conditions (8-30) and (8-35) into (8-36). More generally, sticking 
to our o* theory, let us consider the following set of renormalization conditions, 
depending on an arbitrary mass scale p: 


é 
DR (p”) p= = Vi —m ap? TR (p”) 


Pk’ |s,= —A 


where S,, is defined as in Eq. (8-35) but with m replaced by uw. 


a | 


(8-58) 
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This is a perfectly sensible choice since it is satisfied to lowest order, and it interpolates between 
conditions (8-30) and (8-35), and (8-36). It is, however, safer to choose p such that the renormalization 
points p> = p? and S, lie inside the analyticity regions of the two- and four-point functions respec- 
tively. Otherwise, the above condition should be understood to hold for the real part of the amplitude 
only, 


The theory depends now on two mass scales: m is the mass entering the 

propagator in Feynman diagrams and p specifies the renormalization point. As 
for the physical mass, defined as the pole of the complete propagator, it is some 
function of m, », and 4, and may be computed order by order in perturbation 
theory. Also, the residue at the pole is no longer one, and the computation of S 
matrix elements should take it into account. 
How are two renormalized theories corresponding to two different choices of 
u related? Clearly each one may be reconstructed from the other through the 
introduction of finite counterterms, determined order by order to implement the 
new conditions. As in the case of the infinite renormalization, this is in turn 
equivalent to a redefinition of the parameters m and » of the theory, provided 
we also allow for a finite wave-function renormalization of the field. These 
parameters being equal to the value of the Green functions at a given point y, 
changing p into yw’ amounts to changing the parameters m, 4 (and 1) into m’, 
A’, and z. Hence 


Diets -s Dat Ay) = sO (Di cape, (8-59) 


where m’, 4’, and z are functions of m, 2, u, and yu’, computable order by order 
in perturbation theory. 


This will be illustrated on the two-point functions of the gy? theory in a six-dimensional space. This 
renormalizable theory has the merit of having a nontrivial wave-function renormalization to the one- 
loop approximation. If ¥,,, = —A/3!:@°*:, the self-energy of Fig. 8-7 reads 

d°k | 

(2n)° [(P — k)? — m?](2 — m?) 


Pp?) = — 2? | 


where some regularization is understood and the superscript | refers to the one-loop correction. A 
straightforward computation yields 


(42)? Jija2 p? Jo 
el as {. doa? (1 — a)? 
(@P?)? (4m)? Jo m? — a1 — a) P? 


2 fa) 1 
pl p2) — A | a da e7 Pim? — a(1 —ayP*] 


and 


The renormalized function P'@" satisfying (8-58) reads 


' j2 1 re i 2 
re (p2) = oe { do {bm = o(1 — «)P?] In aoa + a(1 — a)(P? — a} (8-60) 


where we do not bother to perform the « integration explicitly. If we now change pe into yp’, it is 
easy to see that (2) = P? — m? + T2"" satisfies Eq. (8-59) with 
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# m? — o(1 — a)? 
imagen? + d ja- pel Us a ae 
m m (np \ a) (l—a + a*)m eae ar)(u? — 1’) 
(8-61) 
gz ft} m? — o(1 — op? 
=1+—| daa(l — a) In ———___—. 
ny I Be = ll Oe 


The function 4’ would be determined by the calculation of the one-loop three-point function. This 
will be left as an exercise to the reader. 


This may be generalized to arbitrary conditions, in any renormalizable theory, 
with a conclusion similar to Eq. (8-59). The information conveyed in Eq. (8-59), 
namely, the equivalence of changing renormalization points or renormalized 
parameters, is referred to as renormalization group invariance. The implications 
of this equation or of its infinitesimal form—the so-called renormalization group 
equation—will be studied in Chap. 13. We shall see that the innocent-looking 
freedom in the choice of normalization conditions has nontrivial and important 
consequences. 


8-2-6 Composite Operators 


The Green functions considered so far involved only elementary fields, ie., 
dynamical variables entering the lagrangian. The renormalization procedure per- 
formed either through Bogoliubov’s subtraction R operation or through the intro- 
duction of counterterms extends to a wider class of functions involving composite 
operators. By this we mean local monomials of field operators and their derivatives. 
A prototype of such an operator is the electromagnetic current Wy,. Composite 
operators play an important role in many developments. 

For the sake of simplicity, we shall again discuss only the scalar y* theory. 
Composite operators are of the form 9, (a9), pHa, ¢%, @°, etc. but also 
00,0, Ou~ Ov,..., if we construct vector- or tensor-like operators. For power 
counting, these operators have clearly the dimensions w; = 2, 4, 4, 4, 6,..., 3, 4,... 
respectively. To deal with Green functions containing insertions of these opera- 
tors O;(x), it is convenient to add sources y; coupled to them in the action. 
Consequently, 


Zi 0| Pexp : [as [ (x) p(x) + (so |O> (8-62) 


will generate these new functions. Connected Green functions will be obtained 
from the logarithm of Z, as in Eq. (6-71), while the Legendre transformation of 
Chap. 6, performed on the source j only, will generate the proper functions, 1.€:, 
one-particle irreducible but with an arbitrary number of O; insertions. When 
restricting our attention to a finite number of those, we shall perform a finite 
number of derivatives with respect to the y; and then set y; = 0. 

If we consider a diagram with N insertions of operators of dimensions @;, 
a mere application of Eq. (8-13c) reveals that the new superficial degree of 
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divergence w’ differs from the one in the absence of insertions by an amount 


N 
wo’ -wa=) (a - 4) (8-63) 
1 
Insertions of operators of degree less or equal to four in a superficially con- 
vergent diagram preserve the convergence, whereas insertions of degree larger than 
four deteriorate the power counting. However, whatever the (finite) number of 
insertions of composite operators, there exists a subtraction prescription or, 
equivalently, counterterms that make proper Green’s functions finite. For instance, 
assume that the two-point (two external @) function 


€T04(x1)O2(x2)°** On(xw) P(Y) PLZ) sroper 


is superficially divergent with degree m’. There exists a local counterterm, quadratic 
in g and proportional to y;(x):--¥n(x), with a polynomial of derivatives of degree 
less or equal to w’. Clearly this counterterm will contribute only to the function 
(8-64). This will now be exemplified in simple instances. 


(8-64) 


1. Insertion of the @? operator. This operator has a dimension w(p?) = 2 and 
therefore its insertion improves power counting. There are two superficially 
divergent proper functions with @” insertions. They are depicted in Fig. 8-9. 
Both have w = 0. To the first, we associated a counterterm quadratic in 7 
but @g independent, while the second requires a counterterm of the form 
47. The original term $y@? in the lagrangian is thus changed into 


2(1 + 61)x9? + 52x” 
with 6,;, 62 two divergent scalar quantities. If we rewrite 
1 ae or = Late 


as a definition of Z,. in terms of the wave function renormalization Z of , 
we see that n-point functions with a single y insertion satisfy 

Tt p(45 Pis+++s Pn» Am) = ZZ"7T (G5 Pis--+>Pn» 40,™o) (8-65) 
Here q stands for the momentum entering the diagram at the @? vertex. We 
see that gy” is multiplicatively renormalized, as @, with a wave-function re- 
normalization constant Zy:. The insertion of several operators is a straight- 
forward generalization 


I ee Be q2; P15-++>DPn: A, m) = 
Lig Oe pet, (q1, q2 q Pi, tty Pn hos Mo) ain 2626n,0 (8-66) 


(a) (d) 


Figure 8-9 Divergent diagrams with 
insertions. 
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where the last term takes account of the vacuum diagrams (n = 0) of Fig. 8-9a. 
Of course, these new subtractions require new normalization conditions to 
be fully specified. For instance, we may impose conditions at zero momentum 
(2) (9. (0) 
pie p0;0)=1 Vite? (0, 0) = 0 (8-67) 
in agreement with lowest order. 
The renormalization of this y* operator is not independent of mass renormalization. Since the 


? 


addition of a term 3x(x)y7(x) in the lagrangian may be regarded as an x-dependent variation of the 
mass m? -» m? — x(x), it follows that 


Iya; p;4,m) = —Z,: 


é 
z THRs A m)| 24 


om 


where the derivative is taken at fixed Jo. Therefore, if subtractions are performed at zero momentum 
as in Eqs. (8-36) and (8-67) we conclude that 
Omz 


2 ee 
i Om? I 


(8-68) 


‘0 


2. Insertion of the ~* operator. The insertion of operators of dimension four does 
not affect power counting. The counterterms linear in the source y 9s, that is, 
counterterms relevant for a single insertion, will be combinations of {0;, 
i=1,...,4) = {o7(x), 97(x), (29)*(), ep De(x)} (the last two are equivalent 
when integrated over x, but differ in general). These operators are in fact 
mixed by renormalization. Consequently, @* may not be considered inde- 
pendently from those. To be more specific, the generalization of Eq. (8-65) reads 


TO..R(45 P1y-+-s Pn A,m) = YZ"? ZiT OG) reg(45 P1s---» Pn Ao, Mo) (8-69) 
E | td 
This amounts to saying that the renormalized insertion of ~* requires the follow- 
ing source term in the lagrangian: 


a 2 4 6 2 oO 
fot| Za + Zl? 4 aad! o + 224 ope 


where the Z,; are determined after introduction of suitable normalization 
conditions. Finally, from the previous study, we know that ¢7 is multiplicatively 
renormalizable, which means that Z,; = 0 for j # 1 and Z,; = Zg:. 

More generally, in a renormalizable theory, a complete set of composite 
operators O; of dimension less or equal to a given number D, and with the 
same quantum numbers, is multiplicatively renormalizable in the previous 
matrix form, at least as long as we deal with a single insertion. Moreover, 
Z;; = 0 if dim O; < dim O; (the matrix Z;; is not symmetric). Both results are 
obvious consequences of the power counting of Eq. (8-63). 


As an exercise, the reader may investigate the relation between the Z;; for dimension four operators 
and the bare constants Ao, mo, Z. He or she may also carry out the analysis of the renormalization 
of dimension six operators, or the one of the tensor operators, such as the energy momentum tensor. 
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In some instances, it may be interesting to assign to an operator a dimension greater than 
the one coming from dimensional counting, and to renormalize it accordingly. For instance, if instead 
of considering g? as an operator of dimension two we assign to it the dimension four, more sub- 
tractions (and normalization conditions) will be necessary. The new (or “hard”) operator, denoted 
N4(@7) to distinguish it from the old (or “soft”) one N2(@2), will be on the same footing as 9%, 
(d)7, and y Og. In particular, Z,; will no longer vanish. This generalization, useful in some applica- 
tions, has been introduced by Zimmermann. 


In the case of several insertions, with the rule (8-63) as Ariadne’s clue, we 
find that counterterms (multilinear in the sources) keep dimension four as long 
as the initial operators O; have dimension four, but that their dimension increases 
as dim (O;) > 4. For instance, a double insertion of g* and g (1 requires again 
v?, 9*, pM, and (dg)? counterterms, while the insertion of g* and ¢°® also 
involves all operators of dimension six and a double insertion of y® leads to 
counterterms of dimension eight! 


8-3 ZERO-MASS LIMIT, ASYMPTOTIC BEHAVIOR, 
AND WEINBERG’S THEOREM 


Until now, we have sidestepped all problems related to massless particles. The 
object of this section is to present an heuristic discussion on some aspects of 
these questions and their relation with Weinberg’s theorem. The latter deals with 
the behavior of Feynman diagrams as external momenta become very large. 


8-3-1 Massless Theories 


When dealing with massless particles in the internal propagators. we face the risk 
of encountering new divergences. We are going to show that nothing of this sort 
happens as long as we keep away from some particular values of the external 
momenta. In the euclidean (i.e, Wick-rotated) version of the theory, when all 
vertices of the lagrangian have a dimension four (or higher) the proper functions 
are finite at any nonzero and nonexceptional value of the external momenta. 
Nonexceptional momenta are configurations such that no partial sum of the 
incoming momenta p; vanishes. The integrations remain finite for small values of 
the internal momenta because the external momenta provide a lower cutoff. 


This result breaks down in theories containing super-renormalizable couplings. For instance, the 
diagram of Fig. 8-10a with one g” vertex (or, equivalently, a @ insertion of zero momentum) is 


(a) (b) 
Figure 8-10 Infrared-divergent diagram. 
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infrared divergent: 
d*k 


eee | (2)(P — 


This amplitude could also be considered as a six-point function in a @* theory (Fig. 8-106), but 
then it is evaluated for an exceptional configuration. 


A rigorous proof requires careful slicings of the integration domain, very 
much as for the theorem of Sec. 8-1-4. Here we shall just give a simple argument 
based on power counting, discarding possible ultraviolet problems. 


Infrared power counting amounts to finding out how many internal momenta may become soft in 
the diagram while preserving momentum conservation at each vertex. Let us consider an N-point 
function with nonvanishing external momenta, and assume that a definite flow through the diagram 
has been chosen for these hard external momenta. Since the latter are not exceptional, any internal 
line irrigated by them cannot have a vanishing momentum, as all loop momenta go to zero. More- 
over, the same property shows that these hard internal lines form a connected pattern on the diagram 
(the heavy lines in the example of Fig. 8-11). Consequently, as far as infrared behavior is concerned, 
we can contract them into a single vertex. To this vertex are attached N hard external lines and 
i internal lines, with i > 2 since the diagram is one-particle irreducible. Let J, L, V3, and V4 denote 
the number of internal lines, loops, three- and four-point vertices respectively in the contracted 
diagram. We have the usual topological relations 


D1 aves V, + 1) 
N+ 21 =3V3 +44 +(N +3) 


Since, by assumption, all vertices of the lagrangian have degree four, one power of momentum is 
attached to each three-point vertex. Therefore, the degree of homogeneity of the contracted diagram, 
which gives the superficial degree of infrared divergence when all internal loop momenta are simul- 
taneously small, is 
@;, = 4L—21 + V3 
=i22 

At least heuristically, this ensures infrared convergence. 

We might wonder whether relaxing the hypothesis that all internal loop momenta after contraction 
are small would not upset the previous power counting. For instance, let us assume that a number 


Iq <I of internal momenta are kept hard and that they form a pattern with Ly loops and V3 
and V4 vertices of each species. It would seem that w,, is changed into wi, = @, — Aj,: 


Aw, = 4Lu + Van — Ulu 


and that we might be in trouble. Fortunately, this is not the case. The hard pattern may be con- 
sidered as a diagram (possibly disconnected), the external-lines of which are all soft by construction. 


Figure 8-11 Flow of hard momenta through a diagram, and the corresponding contracted diagram. 
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Thus it behaves as a power Aw,, in these momenta. Therefore «,, is not affected. This is not sur- 
prising since «w;, is the homogeneity degree of the diagram. 


We may show even more. When a single momentum p vanishes, the others 
being nonzero and nonexceptional, Green functions remain finite. 


The reader will find no difficulty in extending the previous simple argument to that case and in 
showing that «;, is at most reduced by one unit. 


This crude argument has neglected possible ultraviolet problems. In order 
that renormalization does not upset the result it is mandatory to choose judicious 
normalization conditions. Subtractions at zero momentum must be avoided, since 
Green’s functions are generally divergent at that point. In a zero-mass theory, 
it is safe to choose renormalization points at euclidean values of the momenta, 
for example, p? = — py? < 0, instead of Eq. (8-34) or (8-36). The necessity of intro- 
ducing a non-zero normalization point implies that a theory where the physical 
mass parameters vanish involves nevertheless a mass scale uw. The independence of 
the physical quantities with respect to this arbitrary choice leads to renormaliza- 
tion group constraints, to be discussed later. 

The previous considerations apply as well to theories involving both massless 
and massive particles, such as quantum electrodynamics. Green's functions are 
finite at any nonzero and nonexceptional euclidean values of the momenta. When 
more than one external momentum vanishes a case-by-case analysis is required. 

In summary, if we let all (or some of) the internal masses of a Feynman 
diagram go to zero, we do not encounter any singularity, provided: 


1. All the vertices have degree four. 

2. The external momenta are not exceptional. 

3. There is at most one soft external momentum. 

4, Renormalization has been carried out at some fixed euclidean point. 


What happens when external momenta are continued from euclidean to 
physical on-shell values is a harder question. A corollary of the previous theorem 
is of interest. Consider a proper two-point function and assume that analytic 
continuation to the minkowskian region may be performed, avoiding threshold 
singularities. The Green function remains finite, and so does its absorptive part. 
Owing to the Cutkosky rules (see Chap. 6), this means that any total decay rate 
to final states involving massless particles is finite. This result due to Kinoshita 
is to be compared with a theorem proved by Lee and Nauenberg. According 
to this theorem, any transition probability in a theory involving massless particles 
is finite, provided summation over degenerate states is performed. This is, of 
course, what we found in the examples of Chaps. 4 and 7, where we were mainly 
concerned with soft emission. It should be understood that additional divergences 
may occur when energetic, collinear, massless particles are produced. These cases 
are also included in the previous discussion. 
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8-3-2 Ultraviolet Behavior and Weinberg’s Theorem 


We look for a precise relation between the superficial degree of ultraviolet diver- 
gence «(G) and the behavior of Feynman integrals when all the external momenta 
are large. Rescaling all these momenta by a common large factor J is equivalent, 
on dimensional grounds, to dividing all internal masses by the same factor 2. 
Therefore, the problem is closely related to the massless limit considered in the 
previous subsection. 

For simplicity, consider once again a scalar theory without derivative coup- 
lings. We select a Green function evaluated at euclidean external momenta and 
restrict ourselves, for the time being, to the ultraviolet convergent case. After 
integration over a global homogeneity variable, Eq. (8-20) becomes 


a aoaiit [Zumt + OP, a]? 
Wey ee it eee LE, oS 
c(P) (an Ff [l dor 5(1 — Yar) Esti 


If the P are dilated, P— AP, the integral behaves as 4° [remember that 
«(G) < 0 by assumption] provided that 


: Leleux)? 
[Tas oa ~ Ea ~ Pal 


converges. This in turn depends on the existence of a zero-mass limit of I¢(P). 
We see in particular that the large-/ behavior is related to the configuration of 
the P. From the preceding subsection, we know that if the euclidean momenta 
are nonvanishing and nonexceptional, such a limit will exist. In this case, the 
asymptotic behavior is given by the ultraviolet power counting 

Ig(AP) ~ av (8-70) 

A> 

This result extends to the case of superficially divergent, but renormalized, dia- 
grams. However, the power behavior may be modified by powers of logarithms 


Qmax 


Ig(AP) ~ A*? ¥" Cin 4A [1 + O(A-")] (8-71) 
0 


If the zero-mass limit is ill defined, e.g., if the diagram contains vertices of 
degree less than four, and/or exceptional external momenta, we expect some 
departure from this behavior, namely, higher powers than 1% and possible 
logarithmic corrections. 


To illustrate this point, consider the one-loop diagram of Fig. 8-12 with n + 2 lines and vertices: 
n of these vertices represent mass insertions. The superficial degree is w(G) = —2n. The corresponding 
(euclidean) integral] reads 


d*k if 
(2n)* (k? + m?)"*![(P + k)? + m7] 


I¢(P) = 


1 


a 1 2 —_ 21-2 
=a |, da «"[m? + a(1 — a)P*] 
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je 
~ eal 
1 n 
2 n-1 
k 
Figure 8-12 A self-energy diagram with n @? insertions. 
As P? + oo, Ig(P) behaves as 
1 Pp? 
5 In —2 ionu— 
Ig(P) ~ 
P2+0} m? 1 
— — for n> 1 
Pp? (m?)" 


A superficial estimate would have been (P*)~". This may be interpreted by saying that the large 
momentum flows through a single propagator (the upper one in Fig. 8-12), giving a power (P*)~?. 
The coefficient arising from the lower chain of propagators, J|? d*k(k? + m?)"*!, behaves as In P? 
for n = 1, or as a constant for n > 1. 


In the general case studied by Weinberg, the large-/ behavior is related to 
the minimal number of propagators irrigated by the large momentum flow. The 
following result holds in a renormalizable (or super-renormalizable) theory with 
no massless particles. When the euclidean momenta (possibly exceptional) are 
scaled by a large factor 1, a Feynman integral Ig is majorized by 


Ig(AP) ~ 12 +8 (8-72) 


where é is a symbolic notation to express the fact that the power behavior 72 
is corrected by an integral power of logarithms and Q stands for 


Q= ae co(g) (8-73) 


Here g runs over all subsets of G such that (1) the reduced diagram G/g has a 
vanishing total external momentum entering each of its vertices and (2) each 
connected part of G/g is attached to some external lines. In short, this means 
that the (large) external momenta may flow through g without irrigating G/g 
(see Fig. 8-13). Finally, w(g) is the ultraviolet superficial degree of divergence of g. 


Py P3 Py P3 


(a) (b) 


Figure 8-13 Possible flows of the large external momenta (heavy lines) through the box diagram, 
in a nonexceptional (a) and in an exceptional (b) configuration. 
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When the theory is strictly renormalizable (no super-renormalizable coupling) 
and when momenta are nonexceptional, it may be shown that the upper bound 
Q is reached by w(G) in agreement with our previous considerations. The reader 
may apply these general rules to the previous example of Fig. 8-12. 

These results may be extended to minkowskian momenta or to configurations 
where only subsets of momenta become large. Bounds on the power of the 
logarithm of A can also be obtained. 


8-4 THE CASE OF QUANTUM ELECTRODYNAMICS 


This section is devoted to the specific problems of quantum electrodynamics, 
namely, those related to gauge invariance and Ward identities. All these matters 
have already been encountered and treated within the one-loop approximation 
in Sec. 7-1-4 and our purpose here is to carry out the analysis to all orders, in 
a way consistent with renormalization. We shall first derive the Ward identities 
in a more systematic and algebraic way than in Chap. 7. 


8-4-1 Formal Derivation of the Ward-Takahashi Identities 


We start from the lagrangian (6-25) with a photon mass yw. Through Noether’s 
theorem the conservation of the current 


jAx) =e: W(x)p W(x): 4, (x) = 0 (8-74) 


was a consequence of the invariance of the lagrangian under the global phase 
transformations 
W(x) e w(x) x constant 


This property implies relations between Green functions involving a single 
current operator and an arbitrary number of fields y, w, A. 

Our program is to derive first a set of covariant identities as a consequence 
of current conservation and the fact that Green’s functions may be expressed as 
time-ordered products in Minkowski space. We then try to maintain these rela- 
tions through renormalization. The expressions we obtain in this way have a 
Lorentz covariant form and it is understood that we work with covariant T 
products (see Sec. 5-1-7). 

If the caret above a term means its omission, we have 


82 <O| Tj,(x)W(x1)W(y1) Wn) Ap (21) Ap,(Zp) |0> 


= y £0] T {joo Wx] 5(x° — xP (vd) + WOa)L Jol), HOI] OC? — yP)} 


il 
an 


i i 
x W(x) Wey) *** Ap, (Zp) 0 


$Y 0] TWO) Wn A p21) Lol, A (2) 5(x° — 29)--- Ap,(20)|0> 
ne (8-75) 
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The term containing 0,j’ has dropped out, while the remaining terms just come 
from the x° dependence implicit in the T product. We now appeal to the canonical 
commutation rules: 


[jo(x), W(x] 6(x° — x) = — eW(x)d*(x — x’) 
[jo(x), W(x’)]d(x° — x’®) = e)(x)d*(x — x’) (8-76) 
[jo(x), Ap(x’)] 5(x° =x )=0 , 


which express that W, w, and A create quanta of electric charge Q = f jo(x, t) d?x 
equal to —e, e, and zero respectively. In contrast with the explicit form of the 
current, which follows from the minimal coupling prescription, the previous 
commutation rules, or at least their integrated versions, are crucial for the con- 
servation of the charge. The Ward-Takahashi identity (8-75) reads 


8 <O| Ti x)W(xs) (v1) "+ Ap,(Zp)|0> 


= e<0| TWxs)W(y1)"* Ap,(2p) [0 x [5%(x — yi) — 64x — x] 
(8-77) 
We shall investigate further the following cases: 
cs p— 0 self-energy and vertex 
n= | vacuum polarization 
at, ee, photon-photon scattering 


We remark that (8-77) holds also for the connected part of Green functions. 
The forthcoming discussion is formal, as we sidestep ultraviolet divergences. It 
will be justified in the next subsection, where we exhibit a regularization that 
preserves the identities. 


1. Let G,, be the complete photon propagator and G!?! the free one (Fig. 8-14): 
Gute Ge) [ate Gio (x — x') O| Tje{x')Aa(x)|0> (8-78) 


It follows from Eq. (8-77) that 
Os GinaX) a OfGi x) (8-79) 


Interpreted in momentum space, this is seen to imply the transversity of the 


Figure 8-14 The vacuum polarization in quantum electrodynamics. 
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vacuum polarization 


kB pq(k) = 0 
é z (8-80) 
@palk) = —i(Ypok? — k pk.-)@(k?) 
which generalizes the result derived in Eq. (7-6). 
Indeed, in momentum space, Eq. (8-79) reads 
; M? 
KG pak) = kPGiel(k) =—! 12(k? — M2) 
where M? = p:7/A or, after multiplication by Gz) = —iT.,, 
M? 
k= Te —M> kT, o(k) (8-81) 
If we parametrize I,, in the form 
Ta(k) = —[gpek? — kpko(l — A] + Gpo lt? + Dpelk) 
= A(k?)(gpek? — kpke) + B(k?)k ke (8-82) 
the identity (8-81) tells us that 
k2 ea M2 
B(k?) =A mB 


In other words, B(k*) is not affected by radiative correction and @,, is transverse. 


2. The relation between the electron self-energy and the vertex functions is 
obtained by considering the complete (not necessarily proper but certainly 
connected) vertex function ¥,(p’, p) defined as (Fig. 8-15) 


—ie(2n)*5*(p' — p — q)¥,(p’, Dp) 


[a d*x, d*y, ell’ * *1—P* ¥1— 4° *) <0| TA,(x)W(x1)W(y1)|0> 


ll 


—iG!2(q) [ats d*x, dty, elfP 1-1-4) (Ol T77(x)W(x1)W(V1) [0D 
(8-83) 


Siceno -—p—g)%, = 


Dp D 


Figure 8-15 The vertex function and its decomposition into a proper vertex dressed with complete 
propagators. 
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In terms of the vertex function A ,(p’, p) already encountered ia Eq. (7-46) 
[A'(p', p) = y,] and of the complete electron propagator 1S(p) (ek 
m) *], we have (compare Fig. 8-15) 


V Ab’, P) = Gop (a) [iS(p')A? (v’, PIS(P)] (8-84) 
Contracting Eq. (8-83) with k, and using Eqs. (8-84) and (8-79), we get 
e(2n)*5*(p' — p — 9)q’Goo(a)S(p')A*(p’, p)S(p) 
= —q’Gpo(4) [ax dx, dtyy et? =P 1 4 CONTY4(xpyeloculy1) 02 
(8-85) 
Since q?G'\(q) is proportional to q’, it follows that 
e(2n)*6*(p’ — p — 4)S(p')q’A,(p’, p)S(P) 
=a [ass d*x, d*ty, ef? 17? 91- 4 =a Tj (x)W(x1)W(y1) |0> 
We may now use the general identity (8-77) for n = 1, p= 0: 
e(2n)*5%(p' — p — q)S(p')a° ALP’, p)S(p) 
= ie [atx d*x, dty, ef?’ P*¥1—4°) Ol Tyy(x1)W(y1)|0> 


le =F) oe) 


Therefore, 
S(p')q?A,(p’, p)S(p) = S(p) — S(p') (8-86) 
q’A,(p', pP) = S"*(p') — S-*() = [F — m—- SP) — P-m—-D(0)] 
(8-87) 
Differentiating with respect to p’? at q = 0 yields 
Ad. P)= 35 S-*0) (8-88) 


in agreement with Eq. (7-475). 


3. Finally, the Ward identity for the photon-photon scattering amplitude enables 
us to factorize four powers of the external momenta and hence to improve 
the power counting. From Eq. (8-77) for n = 0, p = 3, we learn that the four- 
photon Green function satisfies 


kP'T 5, prpspa(K1> Ka, k3,k4)=0 ky +k, +k3+k,=0 
and similar transversity conditions with respect to k>, k3, k4. It follows that 


Toppan ko, k3, ka) az Kili opment P2, P3» pa) 
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where I’; is antisymmetric in the first pair of indices o, p;. This factorization 
of photon momenta may be pursued for k2, k3, k4, without introducing singu- 
larities. We end up with a four-point function whose effective superficial degree 
of divergence is minus four instead of zero. 


It remains to show that these identities are preserved by the regularization 
and renormalization operations. 


8-4-2 Pauli-Villars Regularization to All Orders 


We choose to work here with the traditional Pauli-Villars regularization, rather 
than the dimensional regularization. The latter will, however, be illustrated in 
the two-loop computation of Sec. 8-4-4. We recall that the Pauli-Villars method 
amounts to regularizing the fermion loops and photon propagators independently. 
The photon propagator is replaced in a standard fashion by a superposition of 
free propagators. Fermions loops, however, are treated as a whole, each one 
standing now for a sum of contributions corresponding to fermions of different 
masses, minimally coupled to the electromagnetic current (Fig. 8-16). Explicitly, 


I 1 
= om OF | 2 Qn! {ts Mage g hm M, + i Ym) (8-89) 


with the convention that Co = 1, Mo = m, and q,°**q2, are the momenta entering 
the loop. We rationalize the denominators, compute the trace, and expand both 
the numerators and the denominators in powers ofA}. We find 


oe 7 | d*p — P,(p?, p* is 47) + Me Pu-1(p”, B° qi, GP) + 
(2x)* Q>2,(p”, jg is q?) ae M2Qon-1(p”, p° Gi» q?) aan 


F ie, en. Prer-+ | 8-90 
-yc | ae as pee ( Qrn O3n - = 


where P, and Q, are polynomials in p of degree less or equal to 2k. For large 
|p|, the coefficient of M;* behaves as |p| *"~**. Therefore, if we impose two 
conditions 


S Ss 
Maer 0 Y C;MP=0 (8-91) 
s=0 


Figure 8-16 A fermion loop. 
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the integrand of any fermion loop behaves as |p|” ° (we discard vacuum diagrams; 
hence n > 1) and is thus superficially convergent. Such conditions may be realized 
through the introduction of only two auxiliary masses 
M? =m? + 2A? 
M3 = m? = NG 


where A? is the cutoff (which will ultimately go to infinity). This choice is such 
that 


(and MZ =m’) (8-92) 


. a ee: 
Ma es Cope ieee (8-93) 


a M2 — M2 ve ~ M2 — M2 


The reason for insisting that C; and C, be integers will soon become clear. As 
for the photon propagator, a single subtraction 


Pe K oko/ 2 
Gpa.reg(k) = | - i(& ee oli age La [u? > ut] (8-94) 


will make it smooth enough to render all diagrams convergent. It is understood 
that uj > co as A? > oo. All the previous prescriptions follow from a regularized 
lagrangian 
i] 1 Bayes ; 
ap alert -A,)(C + pw? + u2)(0°A® — 0°A%) + — A 
= 


reg — 


a 


a : 6+ ACAD + p? + 13) A+ + Y Uilid — 0d — Mis (8-95) 


After integration by parts, the quadratic part in A reads 


1 
LO OTR 
A 2(u3 u?) p (1 ) (1 i) 


= $A,K?(7) [K(u7) = K(u le res aa (uj) Ay 
where the differential operator K ,,(u7) is defined as 
K po(17) = GoA as iT) — 0,0,(1 — A) 


It is then clear that the A propagator, ie., the inverse of the quadratic form 
appearing in La, is i[K~ '(u?) — K~'(ut)],¢. On the other hand, we have intro- 
duced in (8-95) three auxiliary fields (> = W), minimally coupled to the electro- 
magnetic field and endowed with the masses M,, Mz = Msg satisfying (8-92). 
Moreover, , is considered as an ordinary Fermi field, whereas w2 and wW3 are 
quantized according to the Bose statistics! The effect of these strange rules is, 
of course, to reproduce the prescription (8-93). Because of the degeneracy between 
W2 and wW3 and of the absence of the minus sign in their closed loops, C2 = —2. 

We have reached our aim. The theory has been regularized in a satisfactory 
way, since L,.. of Eq. (8-95) is gauge invariant (up to the photon mass term 
and the gauge-fixing transverse terms in 0: A) and the Ward identities of the 
previous paragraph are clearly satisfied. 
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8-4-3 Renormalization 


We now have to show that renormalization may be carried out while preserving 
the Ward identities provided we choose suitable normalization conditions. We 
require that 


DE) | p=m = Sz1(p)| pom = 0 (8-96a) 
OV R () 
— 1 - 
aP vem (8-96b) 
O5(D/ Pigeon = y° (8-96) 
TR (k) = —{(k2g°? — kk) [1 + @e(k?)] — g?@u? + AkPk (8-96) 
OR(0) = 0 (8-96e) 


These conditions are obviously satisfied to lowest order. Equations (8-96b, c) are 
in agreement with the identity (8-87), while Eqs. (8-96d, e) incorporate the infor- 
mation derived from (8-79) and (8-82). As for the conditions (8-96a, b), they define 
the physical mass of the electron, since they guarantee that the complete propagator 
S has a pole of residue one at p = m. Similarly, condition (8-96c) will lead to a 
physical definition of the charge as the coupling between an on-shell fermion 
and a photon of vanishing momentum. However, as already noticed in Chap. 7, 
the conditions (8-966, c,e) may no longer be maintained as the photon mass py 
goes to zero. Indeed, the derivative of the self-energy on the mass shell 
(0/0) > (p)| p=m, and consequently the vertex function, are plagued with infrared 
divergences as y* approaches zero. Therefore, if we insist on taking the limit 
yu — 0, other normalization conditions must be chosen. To remain on the safe side 
we keep here si? small but finite. 

The proof that Ward identities are preserved by renormalization proceeds by 
induction. We assume that they hold up to a given order h”. In other words, 
we have determined to this order the bare quantities Z;, Z2, Z3, Mo, us, and Ag 
in such a way that the lagrangian 


me 
£1 = — BHA’ — ANG, — 0,4,) — 2 6"4,) 
2 
ib, woe “= 
-- — A? + Z2W(id = Mo)W ar eZ way 
y 2 
= — 5 (04S — 0°AB)(OyAoy ~ 8,Aoy) — 52 (0 Ao)? + 5 AB 
+ Wolid — mo — eo Ao) Wo (8-97) 


leads to renormalized Green functions satisfying (8-96). We remind the reader 
that 
Z4e 


Ap=Z37A Wo = Z3!2W ~— and OF F732 


(8-98) 
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Moreover, we assume that, to this order, the Ward identities imply that 
Z3p3 = we Z340 =A and Zi = Z2 (8-99) 


To the next order h“*!, the Green functions are still divergent. We introduce a 
gauge-invariant regularization such as the one exhibited in Sec. 8-4-2 to regularize 
the theory and compute the Green functions using #'"), that is, taking account 
of all lower-order counterterms. Because of the structure of #'")) we observe 
that these regularized functions P'!4* 1) satisfy the identities derived in Sec. 8-4-1. 
Using these identities and the normalization conditions (8-96), it is straightforward 
to verify that the new counterterms needed to order L + 1 have the same structure 
as in lower orders [namely, that no (A”)? counterterm is necessary] and that they 
still enjoy (8-99). 


For instance 


a a 
VA a a [roe 1(p) = poe py] yem Pe | ENE “GD mn 


op reg 
and Zier iy = TAY (p, p) — ALS, p)] y=m = — Ales (enema 
Finally, 

Zyet — Zier 


follows from Eq. (8-88). 


In the renormalized theory we end up with functions related to the bare 
regularized ones through 


EA ae Pi>- ie sD ne Pn; q1,- ==> Ops m, B, é, A) 
= lim Ze Zee (pi, -++s Zp, Mo, Ho, £0; dos A) (8-100) 


Ao 
These renormalized Green functions satisfy the Ward identities as a trivial con- 
sequence of this multiplicative character of renormalization. 

It is important to emphasize the role of the identity Z, = Z, in the charge 
renormalization @9 = eZ3'!*. If several species of charged particles (electron, 
muon, etc.) are coupled to the electromagnetic field, the identities Z? = Z, 
zZ™ — zZ._.., guarantee that renormalization is universal. The concept of charge 
universality only makes sense owing to this identity. 


It would be more appropriate to say that the ratio of renormalized to bare charge is independent 
of the type of charged particle, since in this restricted framework there is no natural explanation 
for charge quantization. This is not the case for certain unified models of weak and electromagnetic 
interactions, where the electromagnetic gauge invariance corresponds to a subgroup of a larger simple 
group of invariance (see Chap. 11). 


In summary the method described in this section is to express the consequences 
of a symmetry in terms of Ward identities and to verify their consistency with 
renormalization. It will again be encountered in different contexts in the following: 
chiral symmetry, nonabelian gauge symmetries, etc. 
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8-4-4 Two-Loop Vacuum Polarization 


The computation of the two-loop vacuum polarization will be presented in mass- 
less euclidean quantum electrodynamics (4 = m= 0), using dimensional regu- 
larization. 


It is rather instructive in several respects: 


(a) It provides an example of features specific to higher-order corrections, ie., not visible at the 
one-loop order. 

(b) It shows how dimensional regularization may be applied in a case involving spinors. 

(c) It is convincing evidence that the renormalization program does work, even when overlapping 
divergences are present. 

(a) It illustrates the statements concerning massless theories and asymptotic behavior. The zero-mass 
computation may also be regarded as yielding the asymptotic (large-k) behavior of the vacuum 
polarization @(k?) in massive quantum electrodynamics. 

(e) It serves as a test of the general results derived in the previous subsections, namely, the trans- 
versity of the vacuum polarization tensor. 

(f) Finally, it exhibits an interesting property of the vacuum polarization, namely, unexpected can- 
cellations at large momentum. We shall elaborate on this point at the end of the calculation. 


Since we are working in the euclidean version of the theory, with a dimensional regularization, 
let us first list some useful formulas. As in Eq. (8-11), the antihermitian matrices satisfy 


{Yus Ww} = —25 py (8-101) 
and, by convention, we choose [compare with (8-11c)] 
tril=4 (8-102) 


All standard identities for contractions and traces of y matrices then follow (d is the euclidean 
space dimension): 


Yep = —4 
fi a —aq’ 
YeYnVo = (d — 2)7p 
YoPuYv¥p = —(4 — A)ypyv + 40 
VoYuv¥aYp = (6 — d)PoYvVu — 2(d — 4)(GuvPo — Suey + Sven) (8-103) 
tr PuYy = —46yy 
tl Yue? pVo = 4(Ouvd pa — SypSve + Syodvp) 
tl YuPv¥p¥ePrYv = —4(Opv5pc5 — perm) 


In euclidean space, the action of massless quantum electrodynamics reads 
= 1 A 
k= [ax Ec —eA)w+ a (0, Ay — 0,A,)? + 5 4s | (8-104) 


In the Feynman gauge, / = 1, the perturbative rules derived from e/© read 


j (5) (P) ba 
Fermion propagator o—_>—o . Se 
Ba 


Photon propagator _ o~-p>-~~e 
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Vertex (7a) pa 


04 B 


and, of course, a minus sign for each fermion loop. When working in dimension d # 4, the charge e 
acquires a dimension (4 - d)/2, as ulready noticed in Sec. §-1-2. If 4 is an arbitrary mass scale, we 
write 


@ = pt M2¢! (8-105) 


where e’ is dimensionless. We shall use this expression whenever we expand our results near d = 4, 
in order to recover correct homogeneity properties. In this way the unavoidable mass scale creeps 
into the massless theory. 

As a warming up, we first recalculate the one-loop vacuum polarization, fermion self-energy. 
and vertex in this formalism. 

The one-loop self-energy of Fig. 7-5 is 


d’p_ tt Prob + Bro] 
(2x p*(p +k)? 


dd 


1 2)d/2-2 
= —e7 tr (Yu? p¥v¥o) (4m a(5. ‘) (k*) 


av Oxy d 
x | ~kor (2 -$)+2er(t -§)| 
2 2 “Pe 


The integral was performed using Eq. (8-9), where m7 = m3 = 0, in terms of Euler's B function 


T,o(k) = —e? | 


; ‘% ) 


Defining e = 4 — d we are led to 


Be? /d d d\ (k?\4!2-2 
Tl) = — Ga a(S. s\r (2 = ‘) (5) (Spck? — kpke) 


a [2 k? - 
~——|--In eo + constant + O(é) |(d,.k* — k,k,) (8-106) 


fe 3nLe 
in agreement with Eq. (7-9), if we identify 2/e with In(A?/m?). With the conventions of Chap. 6, the 


euclidean proper two-point function is the opposite of the inverse propagator. Therefore, after intro- 
duction of the counterterm 


Zi 
(Z3:—"1) = 


the renormalized vacuum polarization reads, in four dimensions, 
Doe = —(pok* — kpkg)@"*\(k?) 
2 (8-107) 


a ot k 
@"1(k?) ey (in — + constant 
3x B 


The constant is an uninteresting combination of x, y (Euler constant), etc. Notice that we have used a 
new type of normalization conditions. Instead of fixing the value of @(k?) at some point, we have 
decided to subtract from (8-106) the pole term only. We shall adhere to this convenient prescription, 
referred to as the minimal renormalization. Such a procedure satisfies automatically the Ward 
identities. The other two diagrams of Fig. 7-7 and 7-10 are computed in the same way. The fermion 
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self-energy is 


ae | atk Yoko 
(22)* (p — k)?k? 


d*k k 
ola = pS eS 
nv an. | Qn)! (p— kre 


ead — ee ; ( 5) (5 ‘) 


rp) 


i 


“ae iC + finite terms . (8-108) 


ein (4x 


Hence it requires a wave-function renormalization equal to 


74 
2 
Cee a = (8-109) 
As for the vertex function it reads 
ak = yAK + dyuky, 
ir =e ee 
PQ, 4) |e (p — Kk + gee 
k + qak 
= €7[(6 — dypyuya — 2(d — 4) Sane — Sepyn + Supe (a ( p 
_ ay us — Sap?n + Oupya)] On @-b*k+ oe 
a a i . Yu + finite terms 
which leads to the counterterm 
e 2 
(Z; —1U= —- Gay? ; (8-110) 


We verify that the identity Z, = Z, is satisfied to this order, and that Z,, Z2, Z3 are the same as 
in the massive minkowskian theory with the identification 2/e = In (A?/m?). This computation was 
clearly unnecessary! 

We now turn to the two-loop diagrams of Fig. 8-17a,,a2,b. The insertion of the counterterms 


: p+q 
p Dp 
k k k k 
Di tak 7) oP Ie oP G) 


ptk 
(a;) (a>) (b) 
(a') (2) (b;) (b>) 


Figure 8-17 Two-loop contributions to the vacuum polarization. The contributions of order }{ counter- 
terms are also depicted. 
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of order h is also depicted. The two diagrams a, and a2 obviously give equal contributions. We 
first compute P') = 14” + [7% in terms of the fermion self-energy Tp) of Eq. (8-108): 


d'p 1 1 
Tk) = —2e? | | — [2\(p) - | 
ay) ome lyse pe 
2e*(d — 2) ( ‘) “| | dp Lp + B70] 
= mie B = jee 3- a2 
(4n)4/? Z 2 2) | (any! (p + k2(p?) 
The integral over p is performed using the identity 


aan aay |, dx x2 742 eo 3F (8-111) 
(p?)? 42, F(3 —d/2) Jo 


and the parametric machinery. This yields 


dé (p + k)upv a 1 i a(5.4-2)aer 
(27)? (p+ k)? @yae (4x)? T(3 — dz) 24 


x [—kukyT(4 — d) + $6yk2T(3 — d)] 


and after some algebra 
8e*(d — 2) B(d/2 — 1, d/2)B(d/2, d — 2)( ey - 
(4x 2-d/2 
x [—2k,.k,T(4 — d) + 6,.k?T(3 — d)(1 — d/2) + k?5,.1 (4 — d)] (8-112) 


Tya(k) = 


Let us expand this expression near d = 4, T(4 — d) = lye — 7 + Ole), (3 — d) = —1'e + (7 — 1), ete. 
We find 


a? 1 k? ‘) 1 Sika Ke :) 
(GQ) (7 \ eee Le = 2 ] poet pal ta omer 
T2(k) 32 fkske| 2 +i (y+in ae 5 in a se r+ - 
eal ie 1 i k? 
bok? | +2 (+i = 2)— Fin Siwy t+ } (8-113) 


where constant terms have been omitted. The contribution I\2’ of the counterterms (Fig. 8-17a4, a5) 
is also readily computed 


rind = -2e4 | Ear | Rare =|5y : 
(2n% (p+) ” pL(4z) p” 
= -52a(6. ‘) (k2\!2-2(K2 5 yy — kk) ( -5) 


a 2 2 2 
os ee ie’ ee y 5 p 
Rae (k On kak 15 ml oe 3 5 + qin ae 2 6 ae 


(8-114) 


I 


Three remarks are in order. First, the contribution of the counterterms is transverse—this was obvious 
since it is essentially the one-loop diagram computed previously —whereas I’ is not. Only the sum 
of all contributions of Fig. 8-17 will be transverse. Second, we see that in the sum [™ +P the 
divergent terms do not cancel. This was expected since the diagram still requires an overall subtraction. 
At first sight, it seems surprising that the dominant terms 1/e* (or In? A?) do not cancel, since the 
internal divergences have been subtracted by the counterterms. But the renormalized fermion self- 
energy behaves as In p? at large p*; hence its insertion in a superficially divergent diagram gives rise 
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to a In? A? divergence 


A 
| Apa hr ~ In? Ne 


The coefficient of the In(k?/:7) term depends on the normalization condition for the self-energy 
diagram. As is seen from Eq. (8-114), changing the counterterm (Z, — 1)!!! by a finite amount results 
in a modification of the In (k?/,:7) term (and of the neglected constant term). However, this dependence 
will disappear in the complete expression of I,,. This illustrates the fact that the renormalization 
group equation (8-59) is only satisfied by a Green function to a given order, but not by individual 
diagrams. 

We proceed to the much more cumbersome computation of Fig. 8-17b. With the notations of 
Fig. 8-17, the amplitude reads 


d‘q 
Pi2(k) = —e4 [sat tr (YvYeYpYaVa p¥6Y a) Lapuv(k, 4) (8-115) 


where 


i. | d’p_polp + khalp + k + alalp + Oy 
appv (2x)4 p?(p + k)*(p fan + q)* 


The p integration will be first carried out. We introduce Feynman parameters «,...,4 for the lines 
of momentum p, p+ q,p +q+k,p+k respectively. We get 


lope = | da, daz daz da, Japuy 
0 


Japuy = | es palo + k)(p + k + q)u(p + qv (8-116) 


x EXp — (a23q7 + as4k? + 2ask-q + Dp? + 2a23q°p + 2asak p) 


where we have used the shorthand notations a3 = «2 + 43, etc., Dy = a, + % + 03 + ag. It is con- 
venient to shift the integration variable 


a a 
p=p+—qt—k=pt+O 


2, a 


and to rewrite the numerator of (8-116) as 


pa(p + Walp + k + gulp + av = (p’ — Dalp’ + k — Qalp’ + k +g — Q)dp' + 9 — Dv 
We then have to deal with integrals of the form 


dép’ eee ol 
(= (nye ™P(- ae (ny)? 


dép 1 
as ye (8-117) 
IS (2n (nye PP + Ree 2) (4ny)*? 


a , Ure Te ! t 1 
(|< PuPvPaPp &XP (ig 7) = ay2(4ny)"? (SyvOap + Onadvp + 5p va) 


while the integrals involving odd powers of p’ vanish. We also need the trace of y matrices. From 
(8-103), we find 


tr (YvYernPaVey BY) 
= —4(d — 2)[Sap(5veSup — SvySop + Svpdau) + 5p (SvaOpa — SvpOex + Sva5en) 
= 5ap(Sve5 up a 5 v5 pe an 5ypSen)] 
—4(6 — d)[5vp(Sea5up — Spe5pa) + Sval5pe5up — Sap5up) + Svo(SabSue — Foa5up)] (8-118) 
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The first bracket is symmetric under the interchange of «and f, whereas the second one is antisymmetric. 
A tedious but straightforward algebra yields 


2 + wy 2c3gk? + 2(a,03 — a2a4)q°k 
(4x)°)4/? exp ee 
Z » 
= —4(d — 2){Q,0Ak? — 24?) + Qogel —k-(k + 2q) + 4q°O) + Qpkal(k + 2q)°q — 2k-Q] 
+ kpQeQ-(k — 2g) — kpkaQ-(q — Q) + 2404eQ°(k — Q) — Splk + @ — a — 00° (k— Q); 


| Japuy tt (YvY¥oVuPaVay oVBYa) 


+ 46 — d){(5 jok® — kpka)O“(Q — 4) — pak Q — Oks) [Sick -(Q — 9) ~ klQ — Q)e)} 
— (2 — d)(24pda — kpke) + (d — 2)5pel(k + q— Q)°(q— Q) - O-(k — QD] 
+ 25 ye(k — 2Q)-(k + 2g — 2Q)} 
a {[k? Spe — kpkel(d — 2)} - = [i ped(d + 2)] 
= —4(d — 2)A, +°4(6 — dA, — = B, + = ie ra Cy (8-119) 


where the origin and the denomination of each term is clear and we have used the symmetry between 
p and o. 
The q integration must now be carried out. We write 


1 Zz 
= H23014 {, dx exp (- X0230%14q ) (8-120) 
0 


a) a 


As the integrand involves 
exp| (x + 1)o23014q7 + a12%34k? + 2era3 — aed 


s 


times powers of g, we perform a new shift of integration variable 


=q — ak (8-121) 
X13 — A2h,4 
where e = ———___—__ 
(x + larga23 
Accordingly, 
Q = G23q' + zk (8-122) 
With 434 Hy A3 — Aghy z=) X23 


Sy are Bes 


Also, we call y the coefficient of qg? in the exponential 


023014 


» 


23014 


x 


y=(x+1) 


1 
eae (8-123) 


Therefore, the desired amplitude (2(k) reads 


(b) eile eee 1 023014 
Toa(k) = —i¢. days Sool 2 
0 @ 30 (4x5°"/2 y 


012023034041 — x'(a14%3 — O20t4)" 
x exp| - Miaastaater — x'(aadts — ort)” 12 


YVio23014 


RENORMALIZATION 421 


: | oom | -aa—24 ree a c,| (8-124) 


It is then a pure matter of patience to compute 


d*q 2 23%14 + 2(a1 — %3)(%4 — &2) 
/2 ~ya = = 
(4nyy! (<4 e-"7 A, = kk, —<—<—. 


+ bpe| 2 — z)(a+z)(l1—a—z) 


2 


7 z pis — &3)[a + 2z — 22(a + z)] + 2(at1 — H3)(o4 — “2 


oo 


kd ; 

- Sh [(~ + z)(l — « — z)&33 + (1 — &3)*2(1 — 2)] 
d(d +2). b 

ap = a, f= ana? | 


(4ny¥!? | gies, apes d)(202 = Dk ke 
—(d— 26,04 + k?[(a+z)(l-—a—z)+2(1-—2)] - > [a3 + (1 — anarh 


d 
+ 289| KU — 2z)(1 — 2z — 2a) — 2&23(1 — a3) ‘] 


rr 


dé tg d—-2. 
(4nyy? | eat ge ‘A, = &23(1 — h23)(k ke = k?5 pa) 


(2n)4 2y 
Consequently, 

Ayr eo L dx!’ (™ day -+: dogaz3%14(1 — x’) 
ee = (any 4(6 — d)(d — 2)(k?5 po — kpka) |, i | Qy a2? yar2 
12023034041 — X'(o103 — O04)” |» 

x exp | — —————____-___——_ k 

mE DY m23%14 
4 
= — — 26 — d)(d — 2)(k?5p5 — kpko)(k?)*-*T(4 — d) 
(4x)? 
: 1 day ++: dagd(1 — a; — &2 — &3 — 4) 
f sale Lt , 
. |, sal : ”{) (11.4023) 74/7 -* 


x [12023034041 — X'(a1, a3 — Caer) a] cae. 


After the change of variables, 
ope a2=(1-B a3=(1-A(l—v) o%=A(l—y) 
op t23= 1-8 
[or2casesadter — x'(otra3 — at2a4)”] = B(L — B){[BU — u) + (1 — BY — »)] 
x [Bu + (1 — B)v] — x'B(L — B)(u — vf} 
|, da, -**da4d(1 — 01 — %2 — &3 — O14) F(a;) = i) dB Bi — B) i, du I. dv F(a) 


0 0 0 oO 
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it is easy to check that this integral is convergent at d = 4. Therefore, neglecting as before constant 


terms in @(k?): . 
[42+ B) J —In a ae" ) (kpka ~ 5pok7) 
4 é mM 


Similarly, in the C, contribution 


e4(d — 2)d(d + 2)5p0 


psn k? d-37° a= 
ve (nye (k*) (3 —@) 
: =! day -++dag 5(1 — a; — &2 — &3 — OH) 
_ | dx! | Oy 4 Of vm 3 — M4 
fy) iy) (%1 4023) 


d-3 
x [012023034041 =X (e103 — tz a4)? | 


the £ integration is also convergent at d = 4. Hence 


2 1 k2 1 1 1 1 
ro — 5 (2 n+) | av’ | ip | au dv 
Tt € BL 0 0 0 0 


x {[B(1 — u) + (1 — Bd — v)] [Bu + (1 — B)v] — BCL — Bu — 0)} 


ae: 1 k? 
~ —— —6,¢k7{--In-—+4+°°: 
mr 24° (; ro ) 


(8-125) 


(8-126) 


On the contrary, the contributions from A, and B, possess internal divergences at /} = 0 or 1. For 


instance, the terms proportional to k,k, read 
4e* 


(4n)¢ 


1 1 1 
rien ae (d —2)/T(4 _ Dk ak ke | dx’ x/d]2~2 | du | dv 
0 


0 0 
1 
x \, dB [B(1 — B))?~*? {[B — u) + (1 — £1 — v)) 


x [Bu + (1 — B)v] — xB — B)(u — v)?}*-* 


| peu ot (bf) of! — 2) 
tO > 


Ba = B) — x'[uv + (1 — u)(t — v)] 


bax 
+ SS tu — oP a oer 


But the required expansion at d ~ 4 is easily obtained if we observe that, for any function F(/s) regular 


at 0 and 1, 
1 


|, dB [BQ — py F)= | ap [BO °* (1 fy) ee CG! 


0 
Ultimately, we obtain 


2 ee e/a ee 1 ae Ps k? 7 
Peart Bo wy peat (Seren cts oe | a. =a ~ 2 a ai aad 
ae 3aa Kok aur area 4 + |n po Pi 7 + 


oc? bys k? I me k? 3 
===) || =< =v Sin he a | ieee Bae : 
32? E (5 ) na) +5hn ea Z ‘ + | (8-127) 


The total contribution of Fig. 8-17b is therefore 


a? Ta I (oe k2 5 
Pe = lie ky | | = en eh? on pe ene 
p a th Ja+2(3 y nia) + 5a + ns ai | 


ho il ui k? 1 k? k? 11 
<a pa) een ad | a ae aay, Per a <a oe 
Opak E + - (\ y—In =) + 5 In Z + In Z ( ‘i ) + } (8-128) 
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We should still compute the contribution of the vertex counterterms, as depicted in Fig. 8-17b}, b}. 
But owing to the Ward identity, it is easy to see that they cancel exactly the contribution of the 


self-energy counterterms (a), a3). Indeed, the latter are proportional to (Zz! — 1M} = ~(Z, -— 1)8), 
while the former are proportional to (Z, — 1)!"!. This is, of course, particular to quantum electro- 
dynamics. 


Adding the two contributions (8-113) and (8-128), we finally get the result 
Ti2) = (kpke — Spek?) o!2(k2) 


8-129 
sae | ee 


1 dé 


We observe all the desired features: 


1. The divergent terms of the form (1/e) In (k?/u?) which could not be eliminated 
by a local counterterm have cancelled. 

2. The vacuum polarization tensor is transverse, as expected from the Ward 
identity. This holds for both the divergent and the finite parts. The former 
may therefore be renormalized by a transverse counterterm of order h?. For 
large euclidean k’, and up to order a”, the renormalized function @g(k?) is 
then given by 

Ge kee. ie 


—S= — — 3 - 
5q a gu lta + Ole") (8-130) 


Or(k?) SS 
The foolhardy reader may check that the finite terms that we have neglected 
are also transverse, or (even better) that the complete expression of @(k*) is 
independent of the gauge parameter A. 

3. Fortunately, the result (8-130) coincides with the one obtained by other authors! 
We conclude that the dependence on the normalization condition of the self- 
energy has dropped out in the sum + T. This is obvious since the two 
counterterms actually cancel. 

4. Unexpectedly, we find that the divergent terms in 1/e? [which in a conventional 
regularization would read In? (A*/y?)] and, accordingly, the In? (k?/u”) have 
disappeared. This is a general property of the vacuum polarization, valid to 
all orders when we restrict ourselves to diagrams with a single fermion loop. 
Indeed, it is possible to fix order by order the gauge parameter A so as to 
make Z, = Z, equal to one. Consequently, the selected subset of diagrams 
does not exhibit any internal divergence and its contribution to the gauge- 
invariant quantity @(k?) behaves as a single power of In (A?/k”). 


This has led to interesting speculations by Johnson, Willey, and Baker and by 
Adler. Could it possibly be that the coefficient f(«) of the logarithmic behavior 
vanishes for a nonzero value of «? Since the same function multiplies In A? (or 
1/e), this would seem to indicate that a reordering of the perturbation series 
might eliminate ultraviolet divergences for some appropriate value of the bare 
coupling. 
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Berlin, 1969. In the early 1970s, W. Zimmermann gave a comprehensive account 
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CHAPTER 


NINE 
FUNCTIONAL METHODS 


The path integral formalism of Feynman and Kac provides a unified view of 
quantum mechanics, field theory, and statistical models. Starting from the case 
of finitely many degrees of freedom it is generalized to include fermionic systems 
and then extended to infinite systems. The steepest-descent method of integra- 
tion exhibits the close relationship with classical mechanics and allows us to 
recover ordinary perturbation theory. Among various applications, we deal here 
with the concept of effective action, quantization of constrained systems, and 
evaluation of high orders in perturbation theory. 


9-1 PATH INTEGRALS 


The original suggestion of an alternative presentation of quantum mechanical 
amplitudes in terms of path integrals stems from the work of Dirac (1933) and 
was brilliantly elaborated by Feynman in the 1940s. Schwinger developed an 
equivalent approach based on functional differentiation. This work was first 
regarded with some suspicion due to the difficult mathematics required to give 
it a decent status. In the 1970s it has, however, proved to be the most flexible 
tool in suggesting new developments in field theory and therefore deserves a 
thorough presentation. 


9-1-1 The Role of the Classical Action in Quantum Mechanics 


Let us return to quantum mechanics to inquire about the role of the lagrangian 
formalism as opposed to the hamiltonian one. For simplicity let us discuss a 
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system with one degree of freedom described by the pair of conjugate operators 
O and P satisfying 

[O, P] =i (9-1) 


We use capital letters for operators to distinguish them from their classical c- 
number counterparts. Let the hamiltonian be 


EO) = _ + V(Q) (9-2) 


and denote by |a>, |b>,... the states of the system. We are going to seek an 
expression for the transition amplitude 
<b(t’) |a(t)> = <b] eH” Jad (9-3) 


With the usual representation of the commutation rules (9-1) we could introduce 
the square integrable wave functions 


a(q) = <q|a> 
a(p) = <p|a> 


and attempt to solve the Schrédinger partial differential equation arising from 
(9-2). Here the improper states |q> and |p> are such that 


Q|G@>=q|q> P\p>=pl|p> 


1 
<q'|q> = 6(q' - <p'| p> = d(p’ — = + ipa (9.4 
q'|q> = d(q' — q) p\p>=4(p'—p) <q\p>=<plq> > (9-4) 


1@ 
<q|P|p> = p6a\p? = 7 32 <q|p> 


Returning to the transition amplitude, we note that we may use the super- 
position principle to insert a complete set of states at intermediate times. This 
is analogous to the use of the Huyghens’ principle in optics. We split the time 
evolution in infinitesimal steps t > t + At and first evaluate 


<qa(t + At)|qi(t)> = <q2| e7 "¥ |qu> 


The boundary condition requires that when At goes to zero the above 
amplitude reduces to 6(q2 — q:). For small At we expect that the matrix element 
is negligible when q2 differs appreciably from q,, whether its modulus decreases 
or its phase oscillates very rapidly. This suggests that we may substitute for 


the potential operator V(Q) its value V(q,) or V(g2), and amounts to an 
approximation of the type 


e iat. g—iAt(P?/2m) 9 —iatV(Q) 
The terms we have neglected involve the commutators [ P?/2m, V(Q)] multiplying 


higher powers of At. They will be negligible if V has a slow variation in the 
neighborhood of g; and q2. This means that in the short interval At we neglect 
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the transfer of potential into kinetic energy. For the matrix element we obtain 
the estimate: 


<qr(t ote At)| qi(t)> ~ <qo| e7 iAt(P?/2m) e7 tAtV(Q) lqi> 


m e7 '8/2 1/2 [m (q2 = ae 
-( 2nAt exp [5 eo via a) 


We observe the consistency of this procedure. For |q2 — q:| >(hAt/m)!? the 
strong oscillations damp the amplitude so that the correction terms are vanishingly 
At/m)''? «1, where V’ is the derivative of the potential. 
This may be used to obtain a more symmetric form by replacing V(q,) by 
3[V(q1) + V(q2)], for instance. 

Expression (9-5) is suggestive. Indeed, (q2 — q,)/At is the analog of the velocity 
q, and the exponent reduces to iAtL(q, q) where L(q, g) 1s the Lagrange function 


L(q, 4) = 4mq’ — V(q) (9-6) 


More precisely, let g(t’) be the trajectory from q; to q2 during the time interval 
(t,t + At) according to classical mechanics, 1.e., obeying the principle of least 
action. As we have just seen, in the limit At +0, only values of the kernel for 
g2 in the vicinity of q, matter, in a domain of order (At/m)'/?. The phase of 
the transition amplitude is then equivalent to the action 


q2(t + At) 
10,1) = | dt’ L(q,q) (9-7) 
q(t) ' 


evaluated along the classical trajectory from (qi, t) to (q2,t + At). The latter 1s 
not too different from a linear path 


ae i 
a= (14 iG ne Ap? 


— 2 t+ At in 2 
12, 1)~s aE re | dt’ V {g(r} = BHM” _ aeviay (9-8) 


ner 2 \ 12 
and <qalt + At)| qi(t)> = ( i, eillaatt + At).q(] (9-9) 


In the limit At — 0 we check that this reduces to 6(q2 — 41). . 
For a finite time interval the superposition principle allows us to write 


fl nal 
Cap ty| qis i> = in | I] dqp I] Rat Ep+1 \dp» ow 
no i 0 (9-10) 


tp= tit = (ty — t) fo = fj t, = ty 
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Hence 


<q gi, ti> = lim ‘ | ‘la exp | gla dt L(q, q) | (9-11) 
wit Ate = a l i - 
J / - ) | I a 2n(ty = ti) t, 


As a matter of fact the original phase was really the sum 
I=I(n,n—1)+ I(n— 1,n—2)+---+ (0,9) 


that is, the action computed along a broken path as shown on Fig. 9-1. When 
the interval At = (ty — t;)/n shrinks to zero, it is conceivable that we would obtain 
the action along an arbitrary path. To define precisely the set of functions q(t) 
which contribute essentially in this limit is a nontrivial mathematical matter. For 
smooth enough V(q) they are the so-called lipschitzian functions of order 3 in the 
present case, which means that | q(t’) — q(t)| is bounded by a constant times 
|r’ — t|/?. The careful reader is referred to the literature on this point. A physicist 
will, however, proceed without fear and succeed in extracting a surprising amount 
of information from formulas such as (9-11) without the need to investigate in 
more detail the implications of the limit n > oo. 
We shall use the shorthand notation 


apts | dis ti? = {20 exp [ | ‘dt L(q, a| (S12) 
The “measure” on the functional space of trajectories q(t) is denoted Z(q) by 
including the product of normalizing factors(m e~'**/27Ar)! 7, and q(t) is restricted 
by the boundary conditions q(t;) = qi, q(t) = qr. 
A crucial property of the path integral reflects the superposition principle. 
It implies that if t lies in the interval t;, t; we must have 


[20 ele — | aan | 20 eli fn | 20 eiltti) (9-13) 


Q 
x 

I 
_ 


7s 


Figure 9-1 Broken path of integration. 
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If h is reinstated in these expressions the integrand becomes e#!/". The classical 
limit h — 0 involves naturally the evaluation of the path integral by the stationary 
phase method. In this way the classical trajectories correspond to the extrema of 
the action. This presentation of quantum mechanics leads therefore in the most 
straightforward way to the dynamical principle of classical mechanics as a limiting 
case. If the classical trajectory from q; to qy is unique, we expect that up to a 
normalization factor, as h > 0, 


Cf lid > elms 
h-0 


Quantum mechanics can then be interpreted as implying fluctuations around this 
classical trajectory. 


Mathematical subtleties associated with this formalism stem from the fact that the “measure” D(q) 
is complex and the integrand an oscillatory function. This suggests that an euclidean theory obtained 
after a Wick rotation to imaginary time might afford a more manageable mathematical entity. It 
now involves the matrix elements of the operator e ““' and corresponds to a shift from the Schrodinger 
equation to the heat, or diffusion, equation. Indeed, this was the situation considered by Wiener, 
who first discussed path integrals in a mathematical context. To summarize we have now answered 
the question raised in the title of this section, namely, we have recognized that the classical action 
appears in the quantum mechanical amplitudes as an exponent in the weight of trajectories in path 
integrals. 


The formalism may be extended to matrix elements of operators. In this 
way we shall recover Schwinger’s quantum mechanical action principle. Suppose, 
for instance, that we want to compute the matrix element of an operator @ at 
an intermediate time t between t; and t;: 


<ay, ty| O(t)|qi, ti? = <qr| eH —9 © e— HH) |g) 


= {ar dq” | 20 elas.tss 9.0) <q"| 0) |q'> | 20 el (q'st5 gists) 


& 


For simplicity let us assume that © is diagonal in the q representation 
<q"| O|q’> = 0(4')5(q" — 9’) 


The above expression reduces symbolically to 
Caps ts| OW |a, ti = [20 el") O[q(t)] (9-14) 


This may be further generalized to a time-ordered product of operators 


O,(t1)@2(t2)-*° hie 2" 


If they are all diagonal in the q representation we have 


aps tr| O1(t:)O2(t2)-*-|q:, = {2 ao” O,[.q(t:)|O2La(ta)] 15) 
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Using these results we may consider the effect of an infinitesimal change in the 
dynamics between t; and t, (for instance, a slight variation of the potential or of the 
boundary conditions). The corresponding variation of the transition amplitude 
may be expressed as 


bas, tr| qu tir =i {2 eS) SIF, i) 


This has the form of (9-14) and (9-15) with an operator which we denote ol, 
depending, if the case may be, on the intermediate times, so that we can write 
in general 


d<ay, tr | Qi tid = iXgy, ty| OI qi, te? (9-16) 


This expression may be used as an infinitesimal substitute to functional integrals. 
For instance, if the variation is a result of an increase in the final time from 
tr to ty + dty for which 6] = — Hot, formula (9-16) reproduces Schrodinger’s 
equation in the form 


5<qs; ty| gi, ti> = —i<qs, tr| H(ty)|qi, ti Oty 


Let us check Eq. (9-11) for a simple harmonic oscillator 


OO ae a (9-17) 


of frequency w and energy levels E, = (n + 4)a. 
We apply (9-11) as it stands. On any short interval (r,t + Ar) we approximate an arbitrary 
path by a linear one in such a way that the corresponding increment in the action is 


AI(2,1) = m[@Q2-ai) _ ve q3 + 4241 + qi 
At 3 
Set t = ty — t;. We find 


i? =n {T I] da,( exp {2 


noo 


HM s 


n OTs ' 
+ Ge — Ip-1) — Gy (4p + Gpdp-1 +.497-1) 


(9-18) 


We have to integrate the exponential of a quadratic form e‘"/?4”@ where q stands for the set 
Go = Gis 15+++) Qn—15 In = Qe and M is such that 


ie (ert 
Maser = Mass =—(24+24) l<k<n-1 
t  6n 


All other matrix elements are zero. The integration variables are Gis--->Qn-1- If N is the matrix 
obtained by deleting from M the rows and columns with indices 0 and n, we have 
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el 


9Mq = Moo(q6 + ga) + 2Moi(Gog: + Gn-14n) + SY QyNuq 
jl=1 
Using the classical method to compute gaussian integrals, it follows that 


(m—1)/2 
[ant ellmi2)qNq — (= ent) a (det N)~ 1/2 
m 


Linear terms in the exponential are taken into account by a translation, with the result that 


—in[2\n/2 O} in/2\ (n—1)/2 
<s| = tim (™E) (7 ) (det y= 
2nt m 


no 


im 
x exp iz [Mool(gé + 92) — M3(Nis'gd + Nai n—192 + 2Ni3-saoad} 


Defining a to be equal to 


_ 11+ @7t?/6n? 


2 1 — wt?/3n? 


the determinant of N is given by 


eee a a alate 
det = 20-1(2— 5 det,-1 4 | 1 -a 


The remaining determinant will be denoted /,_ ,(a). For fixed a it satisfies the recursion relation 


I,(a) = I,-3(a) — a7I,-2(a) oa) = I1(a) = 1 


(ya) (0) i) 
I-i/ \l oO 1 


After diagonalization of the 2 x 2 matrix we find 


This is solved in the form 


A (a) — AX Dodi / 4a 1 
& eater), foe es, [oat 
A.(a) — a-(a) 2 se 
For large n we thus have 
n sin wt 


I,-1(a) seh ng 


Consequently, 


mo —\" 


css ( 


im a 
= lim exp iz [(q? + q3)(Moo = M2,N7}) = 2q:9;MoiNia- n 
27 sin wt 2 


no 


When sin wt < 0 a careful calculation allows one to choose the correct phase. Let us leave this point 
aside by assuming wt < z. To conclude, we have to evaluate 


t I,-2(a) ¢t ( ot ) a 
ee ———————— ~-—{1—— cot wot }+ O(n”) 
Nia 2n(1 — w?t?/3n?) I,-3(a) 0 n ( 


t Cie ot? 
7 i —— Ea ee O an 3 
Niw=1 2n(1 — w*t?/3n?) I,-1(a) _n? sin wt wr) 
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The final expression 


(f|d= (ee exp {me a + q?) cot wt — 245% Zara. |} (9-19) 


2n sin wt sin wt 


coincides, of course, with the result obtained through more traditional means. It is interesting to 
note that in this special case the coefficient of the exponential is equal to the value of the action 
for the actual classical trajectory over the finite time interval. This is a particular feature of quadratic 
hamiltonians. 

As a second exercise the reader may apply the same method to a time-dependent hamiltonian 
describing the one-dimensional motion of a particle submitted to an external force: 


Pp? 
H = — — QF(t) (9-20) 
2m 
The corresponding amplitude is 
m e~i#? 1" a 
ee esos (9-21) 
al iDe Ee aan é 


where again I f, i) stands for the action evaluated along the classical trajectory 


\2 t, aa = 
I(f, i) = Foe | at Fay + ae) a ) 


ty—t tr —t 


+ x | ir dt’ dt” req| Pe 2 a ti) — Inf (t’ — t,t” — are) (9-22) 


The expression G(?’, t’) = ¢'t’/T — Inf (1’, t”) is the symmetric Green function of the classical problem 
q = F(t)/m already encountered in Chap. 1, satisfying the boundary conditions G{0, rt”) = G(T, t”) = 0. 
The analog of Eq. (9-16) yields 


ary ID Cf|Q() |r (9-23) 


This result enables us to give an algebraic definition of the transition amplitude for an arbitrary 
hamiltonian of the form (9-2). In this case we may write 


q? ty rs) 
| 20 exp iF dt E = via} = exp \- i \ dt V lara F 


This formal expression becomes operational if we expand the exponential operator into a power 
series and generate the perturbation theory. 

All the previous considerations can be extended to finitely many degrees of freedom without 
difficulty. 

Let us now mention an apparently more general derivation of the evolution kernel (9-12). The 
idea is to use both the |q> and \p> bases of the Hilbert space. Let us indeed assume that the mixed 
matrix element of the hamiltonian may be given the form 


(9-24) 


F=0 


il . 
<p| 1 |q> = h(p, q) —= eva (9-25) 
,/2n 
The “classical” value h(p, q) will be equal to the quantum operator H ordered with the P to the left 


of the Q after substitution of c numbers to operators. This will not present any difficulty for hamiltonians 
of the form (9-2). For an infinitesimal time interval Ar we obtain from (9-25) the regular approximation 


{pl e~ 8 |g) 2 —— ira —idchtp.) 
2n 
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Therefore if we write 


n a-t 
<qy.ty| qu ti> = lim | [hedpw [ederXaulips><paliem*"" |an— 1 >---<qs | pi><pi e~ **" |qo> 
1 1 


no 


, Where as usual qo = qi, qn = qs, t = ty — ti, we obtain 


jilted i dq, 


ay tr| qi to = = lim ny 


no 


x exp {frat — Qn-1) + °** + Pili — Qo) — : has Gn-1) — °° — . h(p4, ao 


= | 20. q) exp i \" at [pq om h(p, a} (9-26) 


The last expression is of course symbolic. Observe that we integrate over one more variable p in such a 


way that the boundary conditions only involve q; and q;. We recognize in the integrand the classical 
action 


Kf) = | ‘dt [pd — hip, a)] (9-27) 


expressed in terms of the canonical variables p and gq. If h(p,q) depends on p quadratically, as is 
usual, the integral over p reduces to a gaussian. If we perform the latter, Eq. (9-26) reproduces (9-12). 


The same formalism enables us to study scattering problems. For simplicity 
let us restrict ourselves to a one-dimensional short-range potential. Accordingly, 
the evolution reduces at large times to the free one dictated by Ho, and the 
S matrix is obtained as the limit 


oe lim eit Ho eit, —t)H ao it,H, (9-28) 
tro 
te ae O) 


Its matrix elements between states y, and ; are given by 
Ch y| S |i = lim | ea, dqi WF (qs, ty) <ay, ty | gis ti Wilgi, ti) (@=29) 
The notations are such that 
dp .~ 
W(q) = | e'P4 Wi(p) 
i att 


and w(q, t) is defined as the solution of the free Schrodinger equation such that 
wW(q, 0) = W(q), that is, 


w(q, = | Tam ° ep) (9-30) 


As |t| > 00, w(q, t) may be estimated by the stationary phase method as 


1/2 9 
W(q, t)~ ~(% ) ge sign (t)+imq?/2t W (™2) (9-31) 
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This may be introduced in Eq. (9-29) where we change variables according to 
dr = prty/m, qi = piti/m, with the result 


thie 1/2 
<W,|S|v> = lim | ao, dp; (ete) 


x WE(py) em P72 ee ty ty| = ist) eipit2m (pi) (9232) 


which may be checked in the trivial case V = 0, S = 1. When the path integral 
is used to express the matrix element of the time evolution operator, Eq. (9-32) 
admits an interesting interpretation. Indeed, it follows from this equation that the 
integral involves paths behaving asymptotically as free trajectories: 


iF 
t > co q(t) > P£ 
m 


tae q(t) > me 


In one dimension conservation of energy implies, of course, that pp = +p; but 
the formalism is easily generalized to more interesting many-dimensional cases. 


\ 


As an application let us present the eikonal approximation for three-dimensional scattering on a 
finite range potential V(r) at large energy and small transfer. 
We start from the representation of the evolution operator 


ro, tz | Ras, fe = | ote (t)] exp | ; dt [tmi? — V[r(0)] i} (9-33) 
In the free case (V = 0) this reduces to 


—ix/2 3/2 
| gim(ta—3)?/2(t2 — £1) 
2n(t2 se t;) 


Under the physical conditions of interest, the path integral is dominated by the classical extremal 
trajectories, which for large impact parameters can be approximated by straight lines: 


t—t 
r(t)=r, + = (nl (9-34) 
By, A 


After factorization of energy conservation and a Fourier transformation to momentum space we 
readily derive the following expression for the matrix elements of the transition operator 


=(S — iy 


e- Sitin+$)= = Naa ew foxe] |" ary(6+%)]- 1} 


p* > co ape) (9-35) 


where b (|b| is the impact parameter) and q are two-dimensional vectors in the plane perpendicular 
to the average momentum. 


This approximation has been generalized to the relativistic case. Consider the scattering 
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Figure 9-2 Crossed ladder diagrams. 


amplitude corresponding to the set of crossed ladder diagrams, with the exchange of scalar bosons 
of mass » and coupling constant g (Fig. 9-2). It can be represented by a path integral analogous 
to (9-32) and (9-33). The resulting amplitude for large s (square of the center of mass energy) and 
small transfer t = —q? is given by 


d*k eik-b 
T(s, t) = —2s | db ea"? [exp (e |= ela. =] _ | (9-36) 


The reader may determine the behavior of (9-36) as a function of s > oo and generalize further to 
electrodynamics where the corresponding amplitudes exhibit bound-state poles with the correct non- 
relativistic limit [see Sec. 2-3-2]. 


9-1-2 Trajectories in the Bargmann-Fock Space 


Up to now we have encountered integrals over trajectories in configuration 
space, Eq. (9-12), or phase space, Eq. (9-26). In both cases the boundary conditions 
involved the q or p at each limit. We want to introduce a new type of trajectories 
suited to a generalization to field theory. It is strongly inspired by the harmonic 
oscillator case since a field theory may be considered as an assembly of inter- 
acting oscillators. This will provide an easy parallel with the treatment of 
fermionic systems. 

We make use of the coherent states of Bargmann and Fock discussed in 
Chap. 3. This gives a representation of destruction and creation operators 

pe a ees eee (9-37) 
wp jp 

in a space of analytic functions of a complex variable denoted with a complex 
conjugation bar & or Z. The reason for this choice will appear in the sequel. 
If the original oscillator problem involved a frequency w we could first Renont 
a canonical transformation Q > Qw~'/*, P > Pw!/? before introducing a and a? 
according to (9-37). The analytic ree under consideration generate a Hilbert 
space with scalar product 


iA 
g| >= | <= oF g(2) (2) (9-38) 
2in 
and we have the correspondence 
a> se at—32z (9-39) 


OZ 
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leading to a pair of adjoint operators. An orthonormal basis in this space is 


(9-40) 


Gu 

A unitary transformation maps this description on the more conventional set 
of square integrable functions of the configuration variable qg. To the f, corre- 
spond the well-known Hermite wave functions of the oscillator. Of course, the 
f, are eigenfunctions of the operator H = ala = 20/02 with eigenvalue n. 

In the conventional approach we characterize an operator A by its matrix 
elements <q'| A|q> in such a way that the action of A on a state vector w yields 
the wave function 


[Aw](q) = {ea <q| Alq>W(q) 


Let us proceed similarly in the Bargmann-Fock space. If {n> stands for the 
state corresponding to the function f, in Eq. (9-40) we write 


A=) |\nAsnvn) Arn = “iy (9-41) 
Accordingly, for any state f, 


@|Af>= LaN@= Y va Anmmn| f> 


dt dt ae : 
->| ae a Aum ag © (9-42) 


The kernel naturally associated to A, and denoted A(Z, €), such that 


PAlo= \<4 4 de 8 Az OF 


is therefore 


A i= i a (9-43) 
me 

For sufficiently regular A, the function A(z, €) is an analytic function of the 

two complex variables ¢ and z. Incidentally, it is the desire to write the operators 

in this form which justified the choice of the complex conjugation bar over the 

argument of the analytic functions. 


The representation (9-43) enjoys the following fundamental superposition 
property as a consequence of the orthogonality of the f,: 


dy 
Axle, 8) = ie Sap OM AE AGG, (9-44) 


Returning to Eq. (9-41) we rewrite it as 
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gn a” 
A=) —— |0>A,,,<0| —— 
x fn! Ae </m! 
We recall that the projector on the ground state |0><O| may be expressed in 


: terms ofa normal product as 


OO: eae (9-45) 
Consequently, 


tn 
A= Y Ania e~% ag: = TAN, 2 (9-46) 


nm ~ ./n! m! 
This normal form suggests the definition of a normal kernel to represent the 
operator A. We denote it 4*(2, z) to distinguish it from our previous definition 
with z and z considered as independent variables 


Ze zm 
AN(Z,2)= ) —= AN, —= A=:Aal,a): (9-47) 
nm ./n! /m! 


To obtain the relation between A(z, z) and A(z, z) we may either use Eq. (9-46) 
or observe that the Hilbert space of entire functions is endowed with a re- 
producing kernel analogous to the Dirac delta function in the form 


=n a 


fO=Y Fal =D ie ek EE 1G 


, - (9-48) 
= |) Seto pears Wy 
For the operator ata”, 
tn 7) — 5n a” = dé dé —E+zé Seid” ir) 
Lalaf ](z) = 2 a sia = (44 2 i=l) 
(2 = 
Tl 
we find - 
A(Z,.¢)'= e** A(z, €) (9-49) 


and this relation extends by linearity to an arbitrary operator. 

Starting from Eqs. (9-44) and (9-49) we may compute the kernel corresponding 
to the evolution operator for a quantum mechanical problem. We assume the 
hamiltonian to be given in normal form in terms of the operators a‘ and a. Its 
normal kernel is simply obtained by substituting complex numbers for the 
creation and annihilation operators. We denote this function by A(z, €). For an 
infinitesimal time interval At we have approximately 


Cie Aa ieieettK*) (9-50) 


This reduces correctly to the kernel of the identity when At—0. Repeated 
application of Eq. (9-44) over a finite interval leads to the path integral 
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U(Z,, ty; Zi, ti) = lim 


n-> 00 


TI dz, dz, 
1 2ni 
nad tpt" 


x exp Een — >} ape Zizq i ~ 
i 


-1 
‘a A(Zu+15 a (9851) 
0 


We use for the limit the symbolic notation 


yap + 2,2; a7 
U(Zy, ty; Zi, ti) = | a z) exp Peers + |" dt |? a =e a} Ce) 


In this expression the integration variables z(t,) and z(t;) remain independent 
from 2(t,) and z(t;), which are fixed by the boundary conditions. 

We recognize once again the classical action in the exponent of (9-52). Indeed 
the form p dq or, rather, 4(p dq — q dp) may be written according to (9-37) as 


1 non 
Mp dq — q dp) = 5 (z dz — 2 dz) (9-53) 


which requires to treat z and z as independent variables. Clearly, all that is 
said above immediately generalizes to time-dependent hamiltonians and to several] 
degrees of freedom. 


ro 
Let us compute the evolution operator for an harmonic oscillator driven by an external time- 
dependent force in such a way that 


H = wata — f(t)at — fit)a (9-54) 


Here f is the complex conjugate of f As illustrated by Eqs. (9-19) and (9-21) the result should 
be proportional to e'/-? where I(f,i) is the action computed on the extremal trajectory satisfying 
the classical equations of motion 


z+ ilwz —f(t)] =0 FA (i) \ v4, 
z—ilwz—f()]=0 2t)=z, 
The solution is 


z(t) = z,e7 (+ 5 | 


t 


t 
dt’ e~ iaxt —t') Fite) 


P ot ; eres 
AN SES Tuas i a Salmaaae (2 


t 


These quantities are obviously not conjugate due to the dissymmetry of boundary conditions. The 
corresponding exponent of the path integral along this trajectory is 


LO : 
a 4(Z zy + 2;2;) + 3 | "at [zz — Zz — 2ih(z, z, t)| 
1 
t 
= Zs e7 (ltt) z+ i| ae (2, eit, -1) f(t) + f(t) e7 imtr—1) zi] (9-55a) 
LF 


al | "ae de Fig eFC) 0 = £) 


t 
t 
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Explicit calculation of the path integral gives simply 


U(zs, ty; 2, ti) =e" (9-55b) 


Thus the case of the driven oscillator is very simple in this formalism and the kernel is regular 
everywhere. 


The gaussian integration formula used repeatedly in these evaluations is 
worth quoting here as it occurs as a cornerstone of the application of path 
integrals. If A stands for the matrix of a nonsingular quadratic form the hermitian 
part of which is positive and = and u stand for column vectors of complex 
numbers, 


yi dz, dz, Ss = =A-1 
eae e-? ztuztuz _ (det AS el! u (9-56) 
1 


Note that the exponent on the right-hand side is the saddle-point value of the 
exponent of the integrand. 


9-1-3 Fermion Systems 


Since path integrals exhibit the close relationship between classical and quantum 
mechanics it would seem a priori that we would encounter some difficulties when 
extending the treatment to fermions. Fortunately the relevant construction in 
terms of an anticommuting algebra has been devised by Berezin and we have 
already made some use of it in Chap. 4. 

Let us start from a two-level system with the two operators a and a’ fulfilling 


faas=1 a’?=a"=0 (9-57) 
We shall try to represent them as acting on a Hilbert space of “analytic functions.” 
The analogy with the infinite series in z and z used previously for bosons 


suggests the following. Let us consider series with complex coefficients in two 
anticommuting variables y and 7, that is, such that 


m+in=0 =H =0 (9-58) 
These series reduce to polynomials of the form 
P(n,n) = Po + Pin + Pin + Pi2nh (9-59) 


The set of these polynomials of dimension 27 = 4 may be identified with the 
exterior algebra on a two-dimensional vector space (generated by homogeneous 
polynomials of degree one). The associative multiplication is defined in agreement 
with the rules (9-58). We also introduce the linear derivation 


OP = py + pi2n @P = pi — pian (9-60) 


On each monomial they act by suppressing the corresponding n (for 0) or 4 
(for @) once the latter has been brought to the left; otherwise they give zero. 


440 QUANTUM FIELD THEORY 


Define the subset of analytic functions by the condition 
of =0 (9-61) 
which implies that f depends only on 7. 


Note that 
a@Pp=2P=0 and d0P= —00P =pi2 


which means that polynomials in the derivative operators have the same structure as the original 
anticommuting algebra. The reader will observe that 0(P,P2) is not equal to 6P;P, + P,P, and 
will easily find the correct version of this identity. The construction may easily be generalized to 
several degrees of freedom. With 2n degrees of freedom, the exterior algebra will be of dimension 
2?" and the space of analytic functions of dimension 2”. 


Returning to the case n = 1 we write an analytic function as 


f=fot+hfin 
and define a scalar product such that 
(9.f)=Gofot+ Hh (9-62) 


Here a bar on scalars means complex conjugation. Is it possible to represent this 
scalar product as an integral as in the boson case? The answer is “yes,” provided 
we identify derivation and integration as follows. The integral symbol is defined 
by linearity starting from the requirements 


[arn=1 anni [a1 =[an1=0 (9-63) 


If we also agree that dy and dy anticommute and that the rules (9-63) apply 
when dy and y or dy and yn are brought next to each other, we indeed see that 
integrals and derivatives are identical. Consequently, 


| a P=op | arp = op | eiay p= op (9-64) 


As a consequence the integral of a derivative vanishes (67 = 67 = 0) and the 
procedure is easily extended to several degrees of freedom. 

We have the possibility to make changes of integration variables under the 
integral sign. If we limit ourselves for the time being to linear transformations 
which automatically respect the structure (9-58), ie., of the form 


where A is a nonsingular c-number matrix, a substitution in any polynomial P 
yields 


P(n, 1) = Q(E, &) 
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and in particular 


Pion = q12CE = (pi2 det AEE 
As a result 


[an dn P(n, 0) = | aE acta A)" *Q(é, 2) (9-65) 


This implies a rule differing from the usual one in the sense that the standard 
jacobian appears inverted since 
det A = J (4) 
oS 


A basis being chosen to allow for a definition of analytic functions, let us 
define complex conjugation as 


fin) = fo + fin (9-66) 
We then verify that 


g.f) = {ai dn e~™ g(n) (7) (9-67) 
analogous to formula (9-38) for bosons. We have now the following representation 
of a and a’ in terms of a pair of adjoint operators: 

a>d at (9-68) 
Obviously a? = a’? = 0. Furthermore, 

a(a'f)=fo a'(af)= fan 

As a result 
. aa’ + ata=1 
and (9,af)=Gohi  (ha'g) = figo = (9, af) 


More generally, we can assign an integral kernel to a linear operator on the space 
H# of analytic functions. Consider the orthonormal states |O> and |1> such 
that a|0> = 0, a*|0> = |1, corresponding to the functions | and 7. Let us write 


A= > |n>Anm<ml 
(Af)(n) = | eeae e * An, OF) (9-69) 
A(n, ¢) = ya Hq Ages 


As in the case of bosons we represent the projector on the ground state as 
1) Giaae 2) —1-—a'a (9-70) 
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and rewrite A in normal form: 
A=) Agutert al ty Alga a (9-71) 


The associated normal kernel 
AG — >) Aa to (9-72) 


is related to the previous one through 
A(i, n) = e™ AN (i, n) (9-73) 


For instance, the integral kernel of the identity is e”” so that 
f(f) = | aeac aa (9-74) 
while the product of operators is given by 


A, A2(f, 0) = ae dé e~ ® A, (9, 2) A2(E n) (9-75) 


The analog of formula (9-56) for gaussian integrals is 


~ 


| I dy, dn; exp |-5 Me Anim + D (eb oH Zon) =detA exp b ZA 


(9-76) 


The close parallelism with the boson case allows an immediate transcription to 
obtain a path integral for transition amplitudes. Let H(a‘,a,t) be the normal 
ordered hamiltonian of a fermionic system. The corresponding normal kernel is 
obtained by substituting 7, 7 for a’, a in this order. Consequently, the kernel of 
the evolution operator is given by 


2 ngne thine. ( , [at " 
Ulli, ty mt) = | A, n) exp jens eee | dt [om h(i. af} 
ti; 
(9-77) 


As an exercise consider the motion of a quantum spin 4 submitted to the action of a constant 
field B along the z axis. The ground state is defined as corresponding to the value S, = 4 of the spin. 


The hamiltonian reads H = pB(2a‘a — 1) with p the gyromagnetic ratio. Equation (9-77) shows 
that 


U(iis, tes ni ti) = exp [in Blty — t) + pny en 2M) (9-78) 


Note the similarity with the harmonic oscillator. The treatment can be extended to a time-dependent 
field, including in particular a transverse field rotating at frequency w. 
What is striking when looking at Eq. (9-76) is that the integral of a quadratic form is also 


given, up to a factor, by the value on the “extremal trajectory,” ie., giving a stationary value to 
its exponent. 
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9-2 RELATIVISTIC FORMULATION 


We generalize the previous approach to the infinite systems of interacting fields. 


9-2-1 S Matrix and Green’s Functions in Terms of Path Integrals 


We start with an examination of the familiar case of a neutral scalar field 
coupled to an external real c-number source j(x). The classical action is 


m? 
I0(Q, j) = [a 1000" rigeee io| (9-79) 
and the quantum hamiltonian 
m2 
H= jax Ee *t 3(VQop)* at Gy Pep = jo. | (9-80) 


It describes an assembly of quantum oscillators coupled to varying external 
forces (Chaps. 3 and 4). At a given time the Fourier decomposition of the 
field is 


Pop(X) = {a [a(k) eik** + at(k) e~*'*] 
(9-81) 
Top(X) = —i | a co(k) [a(k) e*** — al(k) e~*°*] 


in terms of which 


H= [ak [oo(k)a' (Kalk) — f(t, K)at(k) — f(t, k)atk)] 
(9-82) 


eK) = [ax e~ kx i(x, t) 


In this form the hamiltonian is diagonal and we can apply formula (9-55) which 
gives the integral kernel of the evolution operator 


Uz pts Zi t;) = exp (| a ao eW tthe, —t) 7:(k) 


+i \ * dt[Z,(k) es F(t, k) + F(t, k) cP"? 2k) 


i 


tie ii dt dt’ F(t. k) emote" F(t, wi) (9-83) 


i 


With the source switched off at |t|— oo, the S matrix is defined as the limit 
of the operator eo U(ty, ti) e-isHo, where Ho is obtained from H by setting 
j = 0. For coherent states, such as those used here, the action of e~ “”° amounts 
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simply to the shift z>ze ‘’', where w is the frequency associated to the 
oscillator described by z. Consequently, the integral kernel of the S matrix reads 


S(zs,z) = lim exp {a Boze 


= tito 


1 \" dt [Z,(k) e'™* F(t, k) + f(t, k) e7'"z,(k)] 


5 : [" i dna Fit, k) eo iotky let" Fu, wv} (9-84) 


From Eq. (9-49) the normal kernel will simply follow if we drop the first factor 
in the exponential. The remaining part is interpreted in terms of the classical 
asymptotic field 


,,(X, t) = {ai [zi(k) e~ *°* + Z,(k) e***] (9-85) 
solution of the homogeneous Klein-Gordon equation. Since 2,(k) is not the 
complex conjugate of z;(k), @,, is given in terms of boundary conditions on 


positive frequencies for t— —oo and negative ones for r> +x. We recognize 
Feynman’s mixed boundary condition. With these notations 


{at | de [Ep(k) f(t, k) + Fit, kb) e-" 2(K)] 
= [as {ak j(x) [Z(k) eiw(kyt—ik-x 4 z;(k) e-iw(ke+ik-x] 


= [atx I(X) @as(X) 
Furthermore, 


| a {la dt’ f(t, k) e~@Mlt—-t! F(t’, k) 


= [Jes d*x! (0) (x) {ak e- iv(ly|t— t+ k-x—x) 
The integral over k is the Feynman propagator 
4 hg (5 I 
[at e-iali)|t—1|+ik-(x—x) — j avk e ®&*) 
(2n)* k? — m? + ie 
= —iGr(x — x’) 
<0| T@op(X) Pop(X’) |O> 
where @p, is the quantized scalar free field. Finally, 


I 
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F(Z p92;)| pvexp [ [as j()@,(x) + : [fers Cl ia (oh Gahec— 0) 


(9-86) 
"This was indeed our result (4-63) and was used as a starting point in the discussion 
of Wick’s theorem. The relativistic covariance as well as Feynman’s ig prescription 
for propagators have followed naturally from the path integral formalism. 


To obtain the S operator which will be denoted So(j), we substitute Pop to 
~,, and normal order: 


So(j) = : exp [ | ax i) | Zo()) 


(9-87) 
Zo(j) = exp E ie d*x’ j(x)Gr(x — x00 | 
Since 1 3 
(C + m’) TC) Zo(J) = j(x)Zol/) 
formula (9-87) may be reinterpreted as 
\. a mes) ee E 
So(j) =: exp at x oo C rn) ral Z (J) (9-88) 


where (L] + m?)(6/6j) acts only on Zo(j). . . 

We consider now more complex interactions. Let us introduce, for instance, 
self-coupling through a potential V(@) in such a way that the action takes the 
form 


2 
[= | d*x Ey -5e- vio)| I(g, j)=1+ [ax jp (9-89) 


As indicated by the notation, we take for simplicity V(g) to depend only on 
~ and not on its gradient. For instance, V(~) may be j*/4!. To derive the 
S matrix we use the same type of reasoning leading to Eqs. (9-23) and (9-24). 
In other words, we have the following relation between normal kernels: 


SN = exp |-: fas v(; fp) [Plo (9-90) 


with S%(j) given by the expression (9-86). . 
The perturbative series follows from the expansion of the exponential operator 
in (9-90). The S matrix itself may be written 


S =:exp | fas Bop(x)(Cl + m?) mat Z(D|j=0 


1 6 


(9-91) 


446 QUANTUM FIELD THEORY 


We recognize in Z(j) the generating functional of Green functions and the 
above formula is identical with Eq. (5-38). What is new here is that we have 
derived functional representations for the normal kernel of the S matrix and for 
Z(j). Indeed, the operator exp [ —i J d*x V(6/idj(x))] generates the self-interaction 
leading to 


~ PA C ra k a k ; ‘ 
F (Ep, 2) = lim | 2) exp || a ce mi) ; ak acl il(S, 0! (9-92) 


with 


: ~ | 2(t, k)z — F(t, lae( hk : . 
Uf i= \ dt \a (tte) 2( CNOA) ok) a(t, k)z(t, k) — hyelZ, 2) | 
1, : 


2i 


(9-93) 
In the present case 


hing = fax V[ ox] (9-94) 


where q is a functional of z(t,k) and z(t, k) obtained by replacing in (9-81) the 
operators a(t, k) and a‘(t, k) by these c numbers: 


P(x, t) = | aetee, k) elk + 2(t, k) ew] 
(9-95) 
Lie | dk co(k)[2(t, k) e&* — 2(t, k) eo“ ®*] 


Before this substitution H;,, has to be written in normal form. Finally, to 
generate the S matrix from the evolution operator the limits ty, —t;-+ oo have 
to be supplemented by the free asymptotic conditions 

lim Z(t, k) = Z,(k) eb 

[d=us(0.0) 


lim z(t, k) = 2,(k) e7 1" 


bias = (0.0) 


(9-96) 


The normalization of the path integral follows from the previous case when 
V is identified with jg. For shortness the measure will be abbreviated as 
Yo, n) or even Dg). 

Similarly, the functionals Zo(j) and Z(j) are expressed as integrals 


Zoi) = | 20) exp E o(9) +i [a ooaits (9-97) 
PAW) | 20 exp E (g) +i [a oie (9-98) 


Io(~) = [ats Ey ~ = 9 | 


FUNCTIONAL METHODS 447 


> m . 
I(g) = [as Ey a5 g(x) — vio) 
It is understood that the measure involves a normalization factor such that 
: Z2(O)e= 1. 
From the expression for Zo(/) we obtain the integral 
: ih) 
| 20001) 6x20 = ful » Crepe) GrlxXp, . — xp, ) 


Mn! 
2°n! permutations 


that is, the explicit form of Wick’s theorem. 


It is, of course, legitimate to consider the path integral (9-98) to be defined as a formal power series 
using Eq. (9-91) or (9-99). This allows us to check that a number of manipulations familiar when 
dealing with integrals—such as changing integration variables, integrations by parts, etc.—are 
justified. As an example let us investigate the effect of an infinitesimal change of variable of the 
form 


(x) = x(x) + eF (x, x) (9-100) 
where F is an arbitrary functional of y subjected to the restriction that it admits an expansion 


in powers of y. This change is canonical in the sense that the relation @ + y may be inverted (as a 
formal series). Taking into account the jacobian arising in the change of variable we obtain, to first 


order in s, 
— sue) ila E I(x) kr 
Z0— oe + ac x cea i + ié [a x Bex) ag ix) FQ Xx) 


x exp i100 + | e<icoxe9| (9-101) 


Collecting the terms proportional to ¢ and extracting the factors in front of the integral by replacing 
x by 5/idj(x), we find 


sr fl 6 16 OF [1 6 
4 Z 9-102 
[a tsa G3 5) + 1 |r CF on +565 vx) Js 


This may also be written 
16 oI {16 
4x Fl —— —~—)+ Z (9-103) 
| : =F elie ( 4 i | ae 


and can be checked directly from Eq. (9-98). 
Let us specialize this general identity to the case where F(x, x) = f(x), that is, the change of 
variable is simply a local translation on the field. Equation (9-103) reduces to 


él Pe) 
9-104a) 
is a + 0) fzn=0 a 


In the scalar field case this reads explicitly 


2) (Gas)! | er) 9-104b 
(+m) oY i 500 i(x) |Z) ( ) 
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(0, +m?) 


Figure 9-3 Graphical representation of the identity (9-104a, b). 


s 


This identity —a direct consequence of the equations of motion—relates functions with n+ 1, n, 
and n— 1 points. The first involves a Klein-Gordon operator and the second an insertion of V’. 
It is depicted on Fig. 9-3. 


Exercises 

(a) Derive the corresponding identity satisfied by G.(j), the generating functional of connected Green 
functions. 

(b) Observe that if the functional F(y,x) is local, ie, only involves y at the point x, Eq. (9-102) 

introduces 6F/5y(x) = F’[z(x)]6(0). How can this term be interpreted? Show that this term is of 

higher order in fh as compared to the other ones in (9-102) and that it is compensated by the 

Wick contraction between F(x) and the piece (0 + m?)y of 51/5y. 

Use the equation of motion to show that the insertion of the operator f d*x y(x)[51/50(~)] 

counts the number of external lines of Green functions. 

Prove the equivalence theorem. The latter states that while an infinitesimal invertible change 

of field variable modifies the Green functions, it does not affect the S matrix. In other words, 

if F(~) = O(@?) the S matrix obtained by applying Eq. (9-91) to Z’(j) given by 


— 


(c 


— 
QQ 
— 


Z'(/) = {2 exp 1 (9) + ic JO)[ EC) + F(@, ~} (9-105) 


coincides with the one computed with the help of Z(j). Indeed, when amputating external lines 
as indicated by the operator (J + m?) in (9-91) the effect of F(@~) only amounts to a wave-function 
renormalization. 

(e) Show that even in the case where the interaction lagrangian contains derivative terms the 
perturbation expansion of the path integral leads to covariant Feynman rules. 

(f) Finally, extend the formalism to Fermi fields. Write the generating functional Z(j) for the Yukawa 
coupling of fermions to scalar bosons and for the case of electrodynamics. In this last case 
show that a change of variables of the form of an infinitesimal gauge transformation leads to 
the system of Ward identities discussed in Chap. 8. 


These functional techniques to derive identities between Green functions will be used 
extensively in the following chapters. 
9-2-2 Effective Action and Steepest-Descent Method 


We have already defined the generating functional of connected Green functions 
G,(j) according to 


Z(i) = exp [G.(j)] (9-106) 


FUNCTIONAL METHODS 449 


and the effective action through the —- transformation 


(x, j) = Gj) 


idj(x) a 
(9-107) 


iT(~) = G.(j) — i [ar JOD e(x) 


To evaluate Z(j) from the path integral we may retain in the exponent the 
quadratic part of the action, expand the rest in a series, and apply Wick’s 
theorem (9-99). The underlying assumption is that the coupling constant is small. 
A slightly more general reasoning is suggested by the path integral representation 
itself. Since the only integrals that can be performed in ctosed form are the 
gaussian ones, the idea is to use the steepest descent or stationary phase method 
(in the minkowskian case) to select the best point around which to expand in 
series. The small parameter is here Planck’s constant h—as is clear if the 
dimensions are explicited and the action I replaced by I/h. Therefore, the above 
method corresponds naturally to the semiclassical approximation in quantum 
mechanics and leads to a series organized according to the number of loops. 

Our first goal is therefore to find the extremal values of the exponent in 
the integral (9-98), that is, the fields @o satisfying the classical equation 


(CO + m?)@o0 + V'(@o) =i (9-108) 


We shall assume that the solution reduces to the trivial one @) = 0 for j=0 
so that, in the sense of formal series at least, the solution is then unique. The 
Green function to be used in the solution of (9-108) involves Feynman’s ie 
prescription. 


The assumption that only the trivial solution remains in the absence of source turns out to be 
invalid in a number of physically interesting cases. Such nontrivial extrema of the classical action 
require a case-by-case analysis for a proper interpretation. 


& 


In the vicinity of this extremal trajectory we shift the integration variable 
as > > +, keep the quadratic part in the field in the exponential, and 
expand perturbatively the higher-order terms. By virtue of the stationarity con- 
dition (9-108) linear terms are absent, and as usual the normalization requires 
Z(0) = 1. Consequently, we find 


2 
Zi ae | 210 ) exp (; [as joo = - [m* + V"(@o)] 


= « Vo ) (9-109) 


p23 


The new quadratic part is 


2 
[a {Hoo = — [i v'(oa) = | ats so[O +m? + V"(@o)le (9-110) 
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and leads to a nontrivial propagator through its dependence on @o. To obtain 
the h expansion let us rescale the field as gp > h'/*@ so that 


Z(j) = eG/h 


2 
ore |e (~) exp (: ars {B00 ~ . [m? + V"(po)] 


— p23 vo (9-111) 


p23 


Wick’s theorem applied to (9-111) yields nonvanishing contributions only for even 
polynomials in gy. Only integer powers of fh will therefore occur in the loop 
expansion. From Eq. (9-111) we read that the leading term (order h°) to G,(j) 
is I(@o, j). Let us compute the next term. The integral over the quadratic part 
yields 


[20 exp \- [ats g[0 +m? + Valo} = (Da Kp Kee? 


Ky = {OD ,+m?4+V"[oo(x)}}64(x—y) (9-112) 
Ko = (O, + m’)d*(x — y) 
As in Chap. 4, we use capital letters for determinants and traces of operators 


of infinite dimension. The inverse of Ko, introduced by the normalization, has 
to be chosen as Gr(x — y). Hence 


Ko ‘Ky = 5%(x — y) + Ge(x — y)V" [poly] 
Since a determinant may be written 
Det A=-erna4) (9-113) 
we find 
l 


h Trin [1 + GrV"(@o)] + O(h?) (9-114) 


To obtain I’(@) we have to invert the relation 

dG(j) 

idj(x) 

According to (9-114) and (9-108) @ is given to leading order by @o up to 
corrections of order h. Moreover, since I is stationary at @ 9 we have 


I(~, j) — I(@o, j) = O(h’). Finally, we have to subtract J d*x ~(x)j(x) from ~iG,(j) 
to obtain I, given therefore by 


p(x, j) = 


rare oye ; hTr In [1 + GpV"(9)] + Oh) (9-115) 
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Figure 9-4 The effective potential to the one-loop order. 


The perturbative interpretation of the second term is clear if we expand it as 


ih oe (1 a-1 
= Trin[1+G,V"(@)] = ih 2 : - Tr {[GrV"(9)]"} 
oo — if wn) A 
= ih z _ [dtes-ata Gr(z1 — 22) V"[p(22)] Gr(z2 — 23)°** 


-- V"[@(2n)] Gr(2n — 21) V"[(z1)] 


This is the sum of the contributions of one-loop diagrams made of n propagators —iG»(x — y) 
and n vertices —iV”(g). It is depicted on Fig. 9-4 in the case of V(g) = Ag*/4!. Notice that 
the factor 1/2n in front of each term of the sum is the symmetry factor of the corresponding 
diagram (n stands for the rotations, 2 for the reflection); similarly in the case of g* theory, the 
factor 5 in V"(@) = A@2/2 takes into account the symmetry between the two external legs attached 
to each vertex. 

This expansion may be carried out to all orders. The successive terms of G, are represented 
by connected Feynman diagrams generated by the interaction term —) ,>3 h?!?~*(p?/p!)V(@o) 
with propagators obtained by the inversion of the kernel 1) + m? + V"(@o). 

As far as I'(@) is concerned, the result of the Legendre transformation is to select among 
the previous diagrams only the one-particle irreducible ones, and to replace @o everywhere by the 
arbitrary argument ¢. Of course, any actual calculation has to face ultraviolet subtractions. 


In short, the steepest-descent or stationary phase method leads elegantly to 
the semiclassical expansion according to the number of loops. To go beyond 
perturbation theory requires either to expand around nontrivial extrema or to 
approximate the path integral in some utterly different way. 

We return to ['(g). We know that its expansion in @ generates the one- 
particle irreducible Green functions. As far as particle physics is concerned it 
is generally this aspect which is relevant. We may also insist on the role of 
I'(@) as an effective action. Taking into account translation invariance we can 
find an expansion involving higher and higher derivatives in the field ¢@ in the 
form 


T'(g) = [as L—Ven(p) + Ze(9)(89)? + °°7] (9-116) 
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In (9-116) the first term involves the sum of all proper functions at Zero 
external momentum, the second sums all second derivatives at the same point, 
and so on. In principle the function g(x) remains arbitrary. However, if we 
wish to compute V.; only, we may satisfy ourselves with a calculation for a 
constant ~ provided we can unambiguously factorize the four-volume divergent 
integration over x. 

As an example let us extract V., up to order f from Eqs. (9-114) and (9-115). 
In general, 


Vag = VO + RY + VQ) + 9-117) 


Using (9-114) we have first 


a 
Vo) = = 9 + Vie) (9-118) 


In the determinant occurring in (9-112) ~ is now a constant and the propagator 
[Ol + m? + V"(¢)]~* is thus diagonal in momentum space: 


Tr In {[0 +m? + V"(g)|(0 + m’)~*} 


4 1 
= ar Gt 5 — V"(9) ed (9-119) 
Tl p- 


ihe ie 


—i{ d* 1 
pees Ga os (0) 
Men y) | 3 E wg) p? —m? + ;| 


This expression is, of course, meaningless before ultraviolet subtractions. To be 
concrete let us pick the potential 


Ag* A 
Vie=F- V9) = 


To comply with the prescription of normal ordering we should also have added a 
term in ~*. Otherwise we have to include tadpole diagrams, corresponding to a 
contraction of two fields at the same vertex. 

This is precisely the only divergent one-loop diagram of the two-point function 
1) (Fig. 9-5a), and it contributes a quadratically divergent term proportional 
to g” in the expansion of VD: 


Ag? | d*p i 
A in)" p* ~ m2 ae 


To this order we also have the logarithmic divergence of the four-point function 
I associated with the diagram of Fig. 9-5b: 


i (wh (ap 
4\ 2 (27)* (p? — m? + ie)? 
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(2) (b) 
Figure 9-5 Divergent diagrams to the one-loop order. 


Higher powers of @ lead to convergent integrals. Renormalization introduces 
counterterms needed to insure that 


TT (m2nys) = 0 aos a iit 9-120 
phys) = dp? (Mphys) = (9-120) 


and a constraint on the four-point function giving the meaning of the renormalized 
coupling constant, for instance, its value at the on-shell symmetric point S: 
pi = Maye 
(pi+ pj)? =3mens iFj (9-121) 
I (pi) |s = —Apnys 


These conditions are useful when dealing with some actual application to a 
scattering problem. They are, however, cumbersome to compute quantities in the 
effective action expanded at zero external momenta. 

Up to a finite renormalization, we can replace (9-120) and (9-121) by 


(2) 
120) = —m? r=! = (9-122) 


To emphasize the distinction we have introduced the label “physical” to the mass 
and coupling occurring in the previous normalizations. 

The virtue of Eqs. (9-122) is that they can be translated as conditions on 
the effective renormalized potential and function Z,,(@) as 


d? 5 
Hae Veg(O) = m 
4 
dot Vag(O) = A (9-123) 
Zeq(0) = 1 
These requirements are obviously fulfilled to order h° as shown by (9-118). At 
this level we also have 
Z) a (9-124) 


The counterterms in g? and ¢* cancel the corresponding terms in the higher 
orders of V,, and Z.. Consequently, the correct one-loop contribution to the 
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effective potential is 


pre Ao? /2 jp?/2 1 (Ag?2? | 
) P a 
Ven 2 (|< in (: pain + ie a p> —m + ie = (p? — m? + is)? 


(9-125) 


It is useful at this stage to perform the Wick rotation po > ipo. We denote by 
k the corresponding euclidean four-momentum so that 


1 dtk 1g?/2 Ag?/2 1 4g?/2 \? 
ie = ~ 9-126 
Ven 5 | aoe in (: ae rae k? + m? ( 
Performing the integral and adding this expression to the zeroth-order term 
given by (9-118) yields 


mo? Ag* 1 Ag? : Ag?\ Ag? 3 Ag? g 
Ver tae pga | (9 | eel ole ee 


(9-127) 


We observe that the large-g behavior of V(@) is modified by the quantum 
corrections. To exhibit it more transparently, it is useful to introduce yet another 
set of normalization conventions to be able to set m = 0. We may define a new 
coupling constant jy, such that 


(9-128) 


From (9-127) it follows that 


ae AM? m? 
—) 6 kn —— aieee 
Am + (8x? n me +16+ o()| 


We use this definition in (9-127) and consider the limiting massless theory for 


which 
4 A2 4 a4 a5 
Valo) = Iu 5+ Gea (Ine — 2) 4 (9-129) 


(167)? M* 6 


a result due to Coleman and Weinberg. It is not possible to directly set m = 0 
in Eq. (9-127). This arises from the structure of the ultraviolet subtractions in 
Eq. (9-126), the second of which, designed to enforce the condition (d*V,,/d*)(0) = 
0, introduces an infrared divergence in the limit m= 0. We have to choose an 
arbitrary but nonvanishing subtraction point @ = M to define the massless 
theory. Stated differently, the I functions at zero momentum generated by the 
expansion of (9-127) are singular as m goes to zero. The behavior when @ > co 
is clear from the fact es the dimension of V is four so that dominant terms 
must be proportional to g* up to logarithms in @/M. Note that the arbitrariness 
in the point M implies that a related change in M and Ay must leave Vor 

invariant. We encounter here a manifestation of the renormalization group to as 
discussed at length later. 
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To first order in the loop expansion there is no wave function renormalization in the y* model. 
Nevertheless, the function Z,(g) is nontrivial to this order even though it contains no logarithms. 
Using the effective action (9-115) it may be shown to be equal to 
32 2 
ap 3 aA cmon 
6(42)° (2m? + Ag?) 


Ze?) = 1 (9-130) 
The massless limit may be obtained as in (9-129). Calculations have been pursued to higher orders. 


Let us quote here the results up to second order with the corresponding diagrams. We use the 
shorter notations 


hp? hh , m* 
= (any? Fenl9) = ‘1 v(x,@) Zeg(p) = z(x, a) (9-131) 
and we obtain: 
Diagram v P 
: 1 
> a 0 
6 
= 2 ax. a aS 
( ) tfa+x In (1 + x) («+=)] eats 
a? 2 a2 2.8 in (1 + 5) a =| 
( x ) zy La + xin + x) — x] | Te a 
a? 1+x 5 2 . 
a ++ 2 In* (1 + x) — 2(1 + x) a n+) +S 
x 4A 
—-1 
«In(L +3) +2x| + [ima +0) + 3 | 
a [In (1 +x) + 4(A — 1)] 
~ (1 +x) 
4 x? 
=-——, +B 9-132) 
z 3 (1 + x)? ( 


Here B is a constant depending on the normalization conventions and A is given by 
: Inu Shee 1 1 | 
Bese a | = 79S (3-133) 
tei I ae Acar (2+ 3p? 


The effective action discussed here is similar to the Euler-Heisenberg lagrangian of electromagnetism 
(Sec. 4-3-4). We leave it to the reader to spell out the details of this relationship. 

The S matrix has an expansion in powers of f similar to the one of Green’s functions. Show 
that the two first terms for its normal kernel are given by 


S (Zs, zi) = exp , T(Pciass) = 4 tr In ft + Gr V" (@ctass) | ar oti} (9-134) 
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where is the solution of 


class 


(Cl + m)Gctass + V’(etass) = 0 (9-135) 


with Feynman boundary conditions; ~,),,, is also given by the integral equation 
Petass(X) = Pas(*) — {ey Gr(x — y) V'C@ctass(¥)] (9-136) 
with @,, given by Eq. (9-85). 


The effective action may be given a more physical definition. In particular, 
V.q(@) may be interpreted as the ground-state energy density under the constraint 
that the mean value of the field is equal to g uniformly. It enables us to 
explore possible instabilities of the system (Chap. 11). 


9-3 CONSTRAINED SYSTEMS 


A number of dynamical systems may be described by observables submitted 
to constraints at fixed time. As a familiar instance we may quote electrodynamics. 
The quantization of such systems is not a straightforward matter and we shall 
soon encounter even more severe difficulties in the case of nonabelian gauge 
theories. Path integrals provide an ideal framework to handle such systems because 
they maintain a close relationship with the classical case where a simple treatment 
may be given. The quantization method requires the elimination of as many pairs 
of canonically conjugate variables as there are constraints. The latter have to 
satisfy suitable compatibility conditions to be clarified below. 

This technical development can be skipped in a first reading and reconsidered 
when turning to the concrete application to gauge fields (Chap. 12), where use 
will be made of the result (9-159). 


9-3-1 General Discussion 


Let a classical system with n(n > 1) degrees of freedom be submitted first to a 
unique constraint 


f (p,q) = 90 (9-137) 


Call C the (2n — 1)-dimensional manifold in phase space characterized by (9-137). 
Our considerations will apply locally, 1.e., in a neighborhood of a point belonging 
to C. Furthermore, we should not distinguish between two functions f and F 
which both vanish on C, that is, such that F(p, q) = «(p, q)f(p, q). Let be the 
ring of (differentiable) functions which vanish on C. We use the notation F ~ 0 
to mean Fe o&. 

We may incorporate the constraint in the action using a time-dependent 


Lagrange multiplier A(t). Equations of motion follow from the stationarity con- 
ditions of 
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[= (a Lpd — h(p, g) — Alt) f(p, @)] (9-138) 
These include Eq. (9-137) obtained by varying A, together with 
. oh of oh of 
i= sa tA B= —-—-—-A— <i : 
ap, ap, p da: A on sin (9-139) 


Of course, on C the variables (p;, qi) are too numerous. A natural compatibility 
condition is that the evolution (9-139) leaves the manifold C invariant. This reads 
in terms of Poisson brackets 


th, f} ~0 (9-140) 


More generally it follows that any F ~ 0 will have a Poisson bracket with h 
belonging to «: 


F~0O={h, F} ~0 (9-141a) 
is stable under the Poisson bracket since it has a unique generator 
F~O=>{fF}~0 (9-141b) 


and Eqs. (9-14la,b) imply that this ring of functions (but not necessarily an 
individual member) is stable under the evolution (9-139). Let F be an arbitrary 
fixed element in ./. We define an equivalence relation E on C as follows. Consider 
the flow generated by F in phase space. In infinitesimal form it is described 
by the equations 


a (u) = in —(u) = — Are F~0 (9-142) 
Two points on C will be equivalent if and only if they belong to the same 
trajectory of the flow (9-142). This equivalence relation E is (1) independent of 
the choice of F in .¥ and (2) invariant under the time evolution. 

Indeed, we note that if a line of flow passes through a point belonging to 
C it is entirely contained in C. Furthermore, if F’ is another element in /, 
since F = aF’, the tangent vectors to the two flows are proportional on C, which 
proves the first point. From (9-141) h is constant on the lines of flow in C. 
Finally, let G be an arbitrary function constant along the lines of flow (9-142) 
lying in C, that is, such that 


F~0=+{G,F} ~0 (9-143) 


Its time evolution during an infinitesimal time interval ot is given by 
G>G+G6t with G={h+ 4G} ~ {h,G} and {{h,G}, F} = {h, {G, F}} - 
{ GaP vy ~ 0 from Jacobi’s identity and conditions (9-141a) and (9-143). There- 
fore G+ 6G is again constant along the lines of flow and this proves the 
second point. 

The manifold C is thus split in a time-independent manner in a set of 
equivalence classes according to E. The factor space C/E may in turn be considered 


458 QUANTUM FIELD THEORY 


as a 2n — 2 phase space provided some regularity conditions on the function f 
are fulfilled. All the flows of the type of Eq. (9-142) are equivalent on C and 
physical observables (such as the energy) are constant under these flows. It is 
thus sufficient to choose a representative in each class, again in a regular 
manner. To this end, we intersect the manifold C by another one described by 
the auxiliary condition 


g(p, q) = 9 (9-144) 


in such a way that each line of flow admits a unique intersection with this 
transverse surface. This is ensured if gy varies monotonically along each line of 
flow, which will follow if 


{f£g} #9 (9-145) 


With such a choice we can explicitly parametrize C’E by performing the canonical 
transformation 


(p,q) ~(P;Q) lsijsn 


(9-146) 
P, = g(P, 9) 
In the new variables the bracket between f and g will be 
_ Of 
- {hop 50, 
and the condition (9-145) will enable us to solve the equation 
of 
ay 0 —— 9-147 
F(Q, P) 60,” 0 (9-147) 
for QO, asa function Ch mame, On— 95 Pisses Po: 
ORONO 122 Ona Pt een (9-147a) 


The definition of the factor space C/E is completed by condition (9-144) which 
takes the form 


P, = 0 (9-148) 
Finally, the quantity 


FG rns Onto ioe Ps 1) = h(i, Pi) 0,=0,(Q;,....P,-;,0) (9-149) 
P,=0 


is the effective hamiltonian on the remaining space, as is readily checked. 

The arbitrariness involved in the choice of the function g required to fulfil 
Eq. (9-145) may create some difficulty in a global definition throughout phase 
space. 


It is easy to generalize this construction to the case of m <n independent 
constraints 


Sil, 4) = 9,..., falp, q) = 0 (9-150) 
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The ring ./ contains the smooth functions vanishing on the manifold C defined 
by the above equations, i.e., of the form 


m 


F(p, q) = ¥. u(p, DAP, @) (9-151) 


1 


and we require that {h, F} belongs to ./ if F does: 
F ~0= {h, F} ~0 (9-152) 


In order to be able to define on C an equivalence relation eliminating m other 
coordinates, we also require that ./ be invariant under the Poisson bracket 


F,,F, ~0=>{F,, F2} ~0 (9-153) 


a condition which was automatically fulfilled when ./ had a unique generator. 

Proceeding as before, we define a fibration on C by considering through 
any of its points the set of trajectories generated by any element F ~ 0. The 
tangents to these trajectories build up a m-dimensional linear manifold in the 
(2n — m)-dimensional tangent space to C. The time-independent equivalence 
relation E identifies points belonging to the same m-dimensional manifold 
generated by ./, and C E has a natural phase space structure. The latter may 
be explicited by the introduction of m auxiliary conditions 


gi(p, q) = 0,...,9m(p, q) = 0 (9-154) 


intersecting the m-dimensional fibers of C at a unique point. For this property 
to hold, a sufficient condition is that 


det {9 fi} # 0 loeb m (9-155) 
A canonical transformation (q, p) > (Q, P) is defined in such a way that 
Bet = Sige sl n= Im (9-156) 


Conditions (9-150) and (9-155) enable us to solve on C for Q,-m+1,-.-,Qn in terms 
of O4,...,On—m, P1,---,Pn, and C/E is finally characterized by 
[econo = 0 Ae ate (9-157) 


On the remaining space the dynamics is generated by a hamiltonian H obtained 
from the original h(p, q) by the canonical transformation taking into account the 
above restriction procedure. 


Show that this construction for m constraints follows recursively from the one given in the case of 
only one constraint. 


To quantize such systems in terms of the independent canonical variables 
P and Q, the hamiltonian H, and the corresponding observables, we simply 
write transition amplitudes as 


f|>= \ GP, Q) exp | ar — H(P, of (9-158) 
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In actual cases it will be generally unpractical to perform the elimination leading 
to the canonical parametrization of C/E. We therefore look for an expression of 
(9-158) in terms of the original constrained variables (p, q). To do this we rewrite 
the measure at each time in the path integral as 


[] 4P.dQe=[] dP. dQ, [] S(PQ6(Q. - Os(Q1,-.-,Qn-m Pis--->Pr)] 
1 1 


n-m+1 


Since | |j dP, dQ, is canonically invariant it is equal to [ [i dpe dqx. Furthermore, 
[]i-m+1 (Ps) = []% 5(g,), and the familiar rule on d-functions yields 


We DG aes fal 
nN Te 6[{Q, = ; Q,( Dis Me es Raee ey a I OL ful, q)] DO iameepietiss: Qn) 


From (9-156) it follows that the jacobian is nothing but det [g,, fi; which we 
write for short as det {g, f}. 
Using the integral representation 


fram = | 11 Sten (-iS a) 
1 1 
and reassembling all the pieces we find 


p= [20.02 qh TT Loa) ) detaigaf } | exp [ | aoe —h- | (9-159) 


The constraints are clearly exhibited and we recognize the action given in (9-138). 
The variables 4 occur without conjugate quantities. 


For this construction to be meaningful it is mandatory that (9-159) be unaffected by a different 
choice of auxiliary conditions g, = 0. Let us verify this point in an infinitesimal form. Consider the 
sffect of a small change 


Gx + 6G, = 0 l1<k<m (9-160) 
The linear system 


m 


8k = om Svs { foi} (9-161) 
admits a unique solution by virtue of (9-155). This means that 
Og, ~ {OF, gx} OF =) bv, f, (9-162) 
The corresponding oF generates a canonical transformation 


p>p+op q7>q+6q dp={5F,p} d6q={6F,q (9-163) 


leaving the measure IT dp dq invariant. Remembering Eq. (9-153) it follows that oF also generates a 
nonsingular linear transformation on the constraints 


f>(1+ bA)f (9-164) 


with the matrix 6A depending in general on the point (p,q). Finally, the action § dt(pq —h) is at 
most modified by boundary terms required to take the new boundary conditions into account. 
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If we rewrite (9-159) after integration on A: 


(f|D= {20 9) T] [5(f)5(g) det {9, f}] exp [ | ex - in| 


.we see that all quantities are defined modulo a function of ./, owing to the presence of 6(f). We 
then apply the canonical change (9-163); using 


T] o(f) > det (1 + 54)~! TT 5K) 
k k 


we see that 


IT [5(f.)5(gx)] det {g, f} + det (1 + 6A)~* T] [5(A)5(ge + dgu)] det {9 + 59, f+ Sf} 


since the differenves dg - [OF,y! and their Poisson brackets with the f and f+ of vanish on C. 
Finally, 


det {g + dg, f+ df} = det (1 + 5A) det {9 + dg, f} 


In summary, 


I] [5(f.)5(gx)] det {9, f} > I] [5(f.)5(gx + dgx)] det {9 + 5g, f} (9-165) 


Therefore we have proved that the path integral (9-159) is indeed independent of an infinitesimal 
Variation in the auxiliary conditions up to boundary terms in the phase. Can the reader explain the 
precise role of these boundary terms equal to exp [i[p@dF/ép]/}? Can one generalize the arguments 
to time-dependent auxiliary conditions? 


9-3-2 The Electromagnetic Field as an Example 


To get acquainted with this quantization method let us return to the electro- 
magnetic field coupled to a c-number external conserved current, with its action 


2 we 
= {atx (ES® — par sia] 


Instead of using only the potential as a dynamical variable we choose the so- 
called first-order formalism, with fields and potential as primitive entities. We 
rewrite I as 


2 2 


p= | ats | -E-(va? + f)- Broo a+ BSE pas ja] (9-166) 


This would reduce to the previous expression if we were to replace E and B in 
terms of A° and A. Varying the action with respect to the fields we recover the 
relations between field and potential 


E=—(VA4°+A) B=curlA (9-167a) 
which imply the first set of homogeneous Maxwell equations 
curlE+B=0 div B=0 (9-167b) 
The variation with respect to A yields the second pair of equations 


divE=p curlB-E=j (9-168) 
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Note that 
0 


6 A(x) 


Among these equations B= curl A and div E = p appear as constraints. The 
first one may be simply solved by replacing B everywhere by curl A which 
amounts after a partial integration to rewriting the action as 


2 
es [as jE - E sail +i + A%(div E — pl (9-169) 


[a y B(y)- curl A(y) = curl B(x) 


2 


If this is compared to (9-138) we realize that A° plays, as did 4, the role of a 
Lagrange multiplier without any conjugate variable. We are thus led to identify 
the canonical variables (p;, g;) with A(x) and E(x) respectively. Poisson brackets 
are defined through 

{Ai(x), E,(y)} = 65° — y) (9-170) 


It is necessary to generalize the compatibility equations (9-152) to cope with the 
case where p depends on time. Those are now written 


OF 
F ~ Oi eevee aaa (9-171) 
where the time derivative operates on the explicit time dependence. In the present 
case, F = div KE — p and 


The condition (9-171) reduces to 


2 Ay? ; 
~ 244 fary[EAGEe ja | ave pl = - 2 _ divj=0 (9-172) 


an identity by virtue of current conservation. Thus, apart from this slight 
generalization, the case at hand can be cast into the framework discussed in the 
preceding subsection. 

It remains to choose auxiliary conditions g(E,A)=0. This is, of course, 
arbitrary to a large extent. A serious simplification will follow if g is linear in 
the dynamical variables as is the constraint div E — p = 0. Under such circum- 
stances det {g, f} will be independent of the variables and hence may be absorbed 
in the normalization of the path integral. A possible condition is 


6, A" = A° + div A = c(x, t) (9-173) 


involving an arbitrary function c. 
We return to covariant notations and write the transition amplitude as 


[oe A) ial 6(0: A — c)exp | [es eg 4F y(O"A” — 8A") — j- ay} 


(9-174) 
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The gaussian integral over B realizes automatically the substitution B—> curl A. 
Similarly, the integral over E substitutes for the electric field its value — (VACA). 
When this is done we have for the amplitude 


| BWA) FOO- A — CBR = [es [4(@,A, — a,A,)("A" — O°A") + jah 


(9-175) 


This is not quite the expression used in the previous chapters. However, since 
(9-175) is independent of the arbitrary function c(x,t) we may complete this 
identification by integrating on these functions c with a weight 
exp [—(i//2) } d*x c?]. If we denote by F,, the quantity 0,4, — 0,A,, the final 
version of the path integral is the familiar one 


ie exp {= as E + : (6+ A)? +i} (9-176) 


This presentation illustrates the arbitrariness implied by gauge invariance. It is 
also worth comparing it to the operator formalism. 


9-4 LARGE ORDERS IN PERTURBATION THEORY 


9-4-1 Introduction 


Functional integrals provide us with new tools to investigate numerous aspects 
of field theory. As an illustration we close this chapter with a discussion of the 
behavior of perturbation theory at large orders. One of the goals of this study 
is of course an attempt at a better understanding of the amazing accuracy of 
the successive approximations in quantum electrodynamics and the other promis- 
ing weak coupling models. Another motivation lies in the hope of overcoming 
the very limitations of the perturbative series and of coping with strong coupling 
phenomena. Even though present-day knowledge is far from being satisfactory, 
interesting developments have occurred which justify this endeavor. This will also 
afford us the occasion to introduce some useful techniques in dealing with path 
integrals. 

The nature of the perturbative series is related to the analytic properties of 
Green functions in terms of the coupling constant in the vicinity of zero. Such 
a study is possible but extremely difficult. 

Fortunately there exists a less rigorous, but manageable approach which 
leads to similar conclusions, namely, that the perturbative series, in all cases of 
interest, is strongly divergent. In spite of this fact it may be very useful, as will 
be explained below. 

Thus the problem may be divided into two distinct parts. First, we look for 
an estimate of large orders in the expansion of Green functions as given by a 
well-defined set of Feynman rules and renormalization prescriptions. Some aspects 
of this program have been completed. It is noteworthy that this is independent 


464 QUANTUM FIELD THEORY 


of the question whether or not the series defines a unique mathematical quantity. 
For certain practical purposes this first step might well be sufficient. For instance, 
it could happen that quantum electrodynamics, as it now stands, is not completely 
consistent, but that power series in ~ are nevertheless asymptotic within the 
context of a deeper theory. 

The second aspect of the problem is indeed to try to reconstruct from these 
expansions a unique theory according to some definite prescriptions. This 1s 
clearly a formidable task which requires some independent information. The 
latter has to be provided by a different construction. Nevertheless, we might 
discover some hints in the structure of the series which suggest a sensible way 
of summing it. 

The typical nature of the divergence may easily be understood on a simple 
example. For this purpose we replace path integrals by ordinary ones such as 


+o 


1 
Z(g) = ml. 


Of course, this case is trivial enough that we might obtain Z(g) in closed form. 
The point g = 0 is an essential singularity. When Reg becomes negative (and 
| Arg g| <2) we may still rotate the integration contour in (9-177). This is no longer 
possible when g approaches the negative real axis and the integral blows up for 
large ~. We may think of @ as the value of a field at a point with g?/2 + g@* 
as a caricature of the action. Negative values of g correspond to an unstable 
situation where the “potential!” is not bounded from below. This is reflected in the 
perturbative expansion if we write 


do e712 +99") (9-177) 


Z(g) = » g'Zy 


aie + 00 4k ee T(2k 1 
oe | do = oe (9-178) 
aff Lie —x e Vv mk 


Using Stirling’s formula 


k! ~ ./2nk ekink—-k 


for large k, we find that Z, behaves as 


_14)\k 
Zi ~ ia e(k—1/2)Ink—-k (9-179) 
\/ Tl 
Nevertheless, the power series in g is asymptotic in the complex g plane cut 
along the negative real axis since 
qnt 'T(Qn =e 3) lg|"*? 
s/n(n + 1)! [cos (2 Arg g)}?"* >? 


meaning that for fixed n and g small enough the right-hand side may be made 
arbitrarily small. 


The asymptotic behavior of Z, was obtained by applying Stirling’s formula 


< 


(9-180) 


Z(g) — x Zig 
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to the exact expression. The latter will not be available in more realistic cases. 
However, this suggests to apply the method of steepest descent for large k to 


ee |e e = 2/9 E 
n= ST Bi, ao (9-181) 


The position of the saddle point is 


p2 = 4k (9-182) 
and integration over quadratic deviations from ¢, yields 
(=a 2 Oe 

Var ie e2kin4k—2k — ea e(k—1/2)Ink—k (9-183) 


as before. 

What can we do from such a divergent series besides using it in an asymptotic 
sense to evaluate the function for small g? In some fortunate circumstances, 
such as the one discussed here, the power series (9-178) contains in fact enough 
information to reconstruct the function unambiguously. Of course, it might be 
argued that we could add any function such as exp (= 7) with vanishing 
derivatives at the origin along the real positive axis. However, within a well- 
defined class of functions excluding these pathologies [and to which Z(g) belongs | 
a Borel transformation enables us to recover Z(g) from its divergent perturbative 
series. 


Introduce 
5 
Bit) = 9-184 
O=d rees ae 


where the choice of I’(k + 3) is justified by the growth given in Eq. (9-179) and 
could be slightly modified in analogous cases. The series on the right-hand side 
in (9-184) will converge within a circle of finite radius in the complex t plane. 
If B(t) may be continued along the entire real positive axis and does not grow 
too fast at infinity, then Z(g) will be given by 


Z(g) = | _ dt t'/?B(gt)e™* (9-185) 
0 


Knowledge of the perturbative series is insufficient to prove Borel summability. 
It is, however, sufficient to disprove it in the case where B(t) would be found 
to have a singularity on the positive real axis. This happens, for instance, if 
asymptotically the Z; have equal phases. 
For our simple example we find 
2 
i) ea 
ii (9-186) 
eo. — | 
= 
eee: tI 


and B(t) is analytic in a plane cut from —7’¢ to — oo. 
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To obtain a convergent expansion for Z(g), let us map the cut t plane onto a 
circle, keeping the origin fixed (this is given here by the choice of variable uw), 
and derive the convergent Taylor series for B(u): 


B(t) = Bu) = ¥ b[u(s)] (9-187) 
0 
from the knowledge of its expansion in t. Then 


Zay= 0 | at t![u(gt)]te~* (9-188) 
0 0 


Since b, decreases here as k° >’? it is easy to see that this new series will 
converge as e~ 3(/39"")"*, 

The behavior exhibited in this example bears a close relationship with some 
divergences encountered in field theory since the expansion of Z(g) in powers 
of g has coefficients equal to the number of vacuum diagrams in a y* model. 
This follows readily from the fact that Wick’s theorem applies to integrals of 
monomials over a gaussian weight. 


This remark applies to other field theories as well. In electrodynamics, for instance, consider the 
integral 


ZG, i, n) = u aa dp diy e~ 47/2-V—eA +n + AW +A 
zt 
(9-189) 
= GA A prinill—eA)+jA 
ie 


where w and W are considered as complex conjugate c-number variables. The expansion of In Z 
in powers of e generates the number of diagrams of the connected functions, except for the 
cancellations implied by Furry’s theorem. These are implemented by symmetrizing the generating 
function of charged loops with respect to e, In(1 — eA) >41n(1 — e? A?), that is, by replacing Z by 
ZG. fi, n) = | as eA’ 2+ini( —eA)+jA (9-190) 


J/1— eA? 


The integral is meaningful for negative e?. For the photon and electron propagators G and S, related 
to the vacuum polarization @ and self-energy 2, we find respectively 


G=(1—@)"* = <A”) 


where the average is over the measure (dA/,/1 — e?A4*)e~*"/?. Surprisingly these expressions 
coincide: 


(9-191) 


d 
G=S= -22|1 +; Ko 22 = (9-192) 


with K(z) the modified Bessel function 


Ko(z) = | dé ez cosh 6 


iv) 
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Expansion of (9-192) for large z yields 


G=S=1 +e? + 4e* + 25e® + 208e8 + 2 146e!° + 26 368e!2 4 --- 


= 9-193 
@® =X =e? + 3e* + 18e° + 153e8 + 1638e! + 20 898e!2 +--- Cle 


to be compared with the number of diagrams for the vacuum polarization with one charged loop 
* only: 


@, = Y (2n — 1)!! e?" = e? + 3e* + 15e° + 105e® + 945e!° + 10 395e!2 + «-- (9-194) 
Similarly, the generating function for vertex diagrams I is equal to 
T = 4z(1 — $)S~?G~* = 1 + e? + 7Je* + 7206 + 891e® + 12 672e!9 +. --- (9-195) 


Extremely courageous people are undertaking the computation of the 891 diagrams of the electron 
anomaly to eighth order, but will anybody ever dream of considering the 12 672 ones to tenth order! 


9-4-2, Anharmonic Oscillator 


Let us apply the previous ideas to the study of a quantum mechanical system. 
Although the method works for any polynomial potential we shall for definiteness 
consider the ground-state energy of an anharmonic oscillator with hamiltonian 


H = 4(p? + 9”) + go* (9-196) 


The configuration variable is denoted g and its conjugate momentum p to 
emphasize the formal analogy with higher-dimensional field theories. The problem 
of the expansion in powers of g may be considered in the framework of the 
Schrodinger equation. We expect an instability for infinitesimal negative g, which 
can be investigated by means of the WKB (Wentzel, Kramers, Brillouin) approxi- 
mation. Since this method will not be available as such in higher dimensions it is 
instructive to use an alternative approach tailored after the previous example. 

The matrix elements of the evolution operator e~ ” can be expressed as path 
integrals. So does the trace of e~*", which may be interpreted as the partition 
function of a canonical ensemble of oscillators. Here f~ * is equal to the absolute 
temperature multiplied by Boltzmann’s constant and 


= j In (Tr e-P#) (9-197) 


is the free energy. When the temperature goes to zero, or f to infinity, F reduces 
to the ground-state energy. 

Thus we may represent the partition function Z(g) as a Feynman-Kac path 
integral over the exponential of an euclidean action (with a change in the relative 
sign of kinetic and potential contributions as compared to the usual expression). 
The configurations g(t) are periodic in “time”: @(0) = g(f), to comply with the 
fact that we are computing a trace 


B +2 2 
Teel enZ (g)= | 20 exp E | dt (2 + so*)| (9-198) 


0 
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The ground state energy is given by the formula 
ems 
E(g) — E(0) = lim E In Zia) (9-199) 
Bo 
with Z(g) expressed through (9-198). 


The expansion of Z(g)/Z(0) in powers of g reads 


Z(g) 
207 29 . (9-200) 


ez: = |. = (|, ‘i 
— t ———+kIn dt 
ai 765) 2° ies 4 


For large k we evaluate Z, using the steepest-descent method. We look for a 
saddle point ¢~,(t) such that @,(0) = 9(f) and minimizing the effective action 


p 
= (. pe i, (| dt o*) (9-201) 
0 Z 0 
that is, 
Ako: 
Ge— Pe+ 75 P ___9 (9-202) 
| dt' pz (t') 
Rescaling @, according to 
a |e 
Plt) =| 75 Q(t) (9-203) 
dt' p*(t’') 
0 
we obtain the equation 
9-G+@=0 (9-204) 


expressing the fact that @ is of order unity while @, grows like k'/*. As compared 
to the usual equation of motion, Eq. (9-204) exhibits two changes of sign beyond 
the rescaling which has eliminated the magnitude of the coupling. First, the 
potential, and hence the force, has reversed its sign due to the rotation to euclidean 
time (another way to put it is to replace t by it so that the acceleration is 
reversed). Second, the effective coupling is negative as shown by the relative 
sign between the harmonic (#) and anharmonic (@%) contributions. The effective 
potential is depicted in Fig. 9-6. The equation is translationally invariant in time 
and admits the symmetry @ > — @. 

By relabeling t we may use a symmetric interval in time [ — f/2, 8/2] instead 
of [0,8]. In the limit 8 —o the interval is infinite and the solution which 
minimizes the action is found to be 

/2 


Qt) ost cosh (t — 2) 


(9-205) 
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Figure 9-6 Effective potential in 
euclidean space V(@)= —@?/2 + 
@*/4. The limiting motion is shown 
in dotted lines. 


up to a global sign and with an arbitrary origin in time t. Clearly this expression 
fulfils the periodic boundary conditions which are here supplemented by the 
condition that the action be finite. We thus have an infinity of degenerate 
saddle points satisfying 


+ 00 pt) + = 4 +00 oo 
Q Sou 4 - gp 2 
dt —= = —— 9-2 
| 5 2 | dt 7 3 | dt 3 (9-206) 


and 


B «2 2 B 
I= lim || EO bin | iv ot) |= —k In (3k2) + 2k (9-207) 


0 


gee 1225 (9-208) 


Due to the degeneracy of the saddle points which follow from the symmetries 
of the problem, the integration over quadratic deviations from the minimum 
requires some care. It is proper to call it a quantization problem since we look 
for the quantum frequencies of oscillations around a classical extremum of the 
action. The question will arise whenever a classical saddle point will be the 
starting point of an approximation to a path integral and degeneracies arise due 
to a continuous invariance. One or several modes will be of zero frequency. 
Here they correspond to time translations and should first be isolated to yield a 
factor B after integration over t. It is, of course, crucial to keep f finite, albeit 
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large, even though it may be taken infinite for those quantities which admit a 
finite limit. The problem bears a close relationship with the quantization of con- 
strained systems studied in Sec. 9-3. 

Thus we want to treat separately the collective mode arising from translational 
motion. To do this we introduce in the path integral a factor unity in the form 


B/2 
{= | dt 5(0y — 7) (9-209) 
=U 


where 6, is defined implicitly by the condition 


{a W(t — O,)p(t)=9 (9-210) 


and the function w is the normalized derivative of @ corresponding, say, to 
Tie 


W(t) = a i (9-211) 
dt é*) 


The reason for this choice will soon appear clear. 

If ¢ is translated by an amount t through g(t) > @.(t) = @(t — 7), this leaves 
the limiting action as well as the path integral measure invariant but 6, is changed 
into 6, + t. Thus the integral over t can be performed explicitly and yields, as 
expected, a factor f. Then 


(—1)* Saif 1 


ae a. a0) 


| Dos(Oy)e7t-® (9-212) 


The 6 function will select a unique saddle point except for a sign ambiguity. 
Expanding around one of the two solutions (the positive one, for instance) is 
allowed provided we multiply by a factor two, since from symmetry the two 
contributions are equal. We set 


Y= Pe TX (9-213) 


and keep the dominant terms in the expansion in y. It then follows that 


0(9o) = | [a Wo+y) | a( | a vn) (9-214) 


To leading order we drop J dt Wy and observe that 


_ {a Wo. = a (| a a = Sk (9-215) 


while I — I, to second order in y reads 
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I-I==| jdat|?+y77- 
5] Jae tae) + (far) | 


With Dirac’s bracket notations we find 


(9-216) 


(1 | 2206u10) exp [—2<y|(K +14 |w><ul)|x>] 


Ag 5 ki ea) 
. \2 exp [—3<z|(Ko + 1)|x>] 
(9-217) 
where the Schrodinger operator K is 
2 
— — =_— (9-218) 
and Ko corresponds to the free part Ky = —d?/dt*. The instructions are to 


integrate the normalized version of @ in the subspace orthogonal to w. The 
spectrum of K + | contains a discrete zero mode with eigenfunction given by w. 
Indeed, by differentiating Eq. (9-204) with respect to time and inserting the 
solution (9-205), we have 


Ee \ eG (9-219) 
ae cosh? 1)? ~ 


This does not affect the integration due to the 6 function. Moreover, since y¥ 
has one node there exists a unique normalized y corresponding to a negative 
eigenvalue of K + 1. The remaining part of the spectrum is a positive continuum. 
It may be noted from the definition of u that (y|u> = 0. 

The operator K corresponds to a new Schrodinger problem with a so-called 
Bargmann potential which admits explicit solutions. We want to compute the 
nonvanishing determinant of K + 1 + |u><u| in the space orthogonal to |y>. 

The final result for Z, will then be 


aug 
k! 
det (K + 1+ |ud<ul). 

det (Ko + 1): 
The factor 2x comes from the gaussian integration over the yw mode omitted in 
the denominator when we consider (Ko + 1),. This factor would have been 
modified had we not chosen to normalize w. The extra projector |u><u| is easily 
disposed of by noticing that 


Ze e '(2k¥/2)N 


(9-220) 


N-?2=2n 


4 
(K + 1) = —29° = ——= (9-221) 


= 


472 QUANTUM FIELD THEORY 


As noted above, since |u > belongs to the subspace orthogonal to w, this means 
that Eq. (9-221) may be inverted in this space to yield 
7 >) 


are p.(1- J3 <ol>) 


= —det(K + 1), (9-222) 


det (K + 1+ |u><ul). = det (K + »,(1 + <ul & 


The negative sign is welcome because of the remaining unique negative eigenvalue 
of (K + 1),. In practical terms, to separate the contribution of the transverse 
modes we may use a limiting procedure by replacing K +1 by K +z, and 
extracting the coefficient of (z — 1) as z 1. Thus 

det(K + 1). _ 1 det(K +z) 


—2nx lim 


=—2 a 
| Sig Fa era a 7 — 1 det (Ko + 2) 


(9-223) 


Several methods may now be used to obtain the explicit expression of this 
Fredholm determinant. The most instructive, which may be generalized to an 
arbitrary one-dimensional potential, would be to relate the evaluation of fluctua- 
tions around a saddle point in the path integral and the ordinary WKB approxi- 
mation for wave functions..We can also note in this special case that the 
fluctuation problem reduces to a soluble equation. Both methods have been used 
in the literature to obtain 


det(K+2) T(t +./z)r(Vz) 


9-224 
det (Ko +2) (34 /DE (Yt - a a 
from which we derive 
N72 = 2 (9-225) 
Putting all factors together results in 
_1)\k 
Li = B (-1f Bk1/2 6 ek In (3k7)—2k (9-226) 
k! T™ 
As expected this yields a k! increase as 
6 1/2 
= pI(k + 4)(— (5) (9-227) 


Due to this wild behavior it is easy to translate this result in terms of the 
expansion for the energy. Indeed, the leading contribution of the coefficient of 
g* in the series for log(1 + )’ cpg”) is [1+ O(1/k)] when the cq grow like 
k!. Thus, applying formula (9-199) we find that the factor B drops out and we 
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are left with 


El)= 44+ 46 


1/2 
B= (1rd tya(S) [1+ 0(2)| 


a result first derived by Bender and Wu. 

In the case of the anharmonic oscillator, Graffi, Grecchi, and Simon have 
been able to prove that the function E(g) is Borel summable, ic., may be 
represented in terms of its Borel transform by a relation of the type (9-185). 

We could then proceed to systematic corrections in powers of 1/k using the 
standard perturbation techniques around the saddle point. The method may also 
be extended to excited states by keeping terms of order e~* in the saddle-point 
action and expanding e™ “ in powers of e~°. 

As a final remark we observe that when looking for periodic trajectories 
of period f we have other solutions with fractional periods f/2, B/3,..., which 
would also lead to saddle points. These have a classical action two, three,..., 
times larger than the lowest one; thus they contribute only exponentially small 
corrections to the results. 


(9-228) 


The extension of these methods to field theory is in principle straightforward, at least insofar as 
renormalization is left aside. We look in a similar way to instabilities occurring for small coupling. 
These are responsible for singularities in the Borel transformed plane. As long as they do not reach 
the positive real axis (assumed to correspond to the physical situations) the theory is sound. This 
extra information enables us to use in the most effective way the first few terms of the perturbation 
series for an accurate determination of physical quantities. This program has encountered a greai 
success in some applications to statistical mechanics of the bosonic ~* model in three dimensions. 
It can be generalized to include fermionic fields. Singularities on the positive real axis may, however, 
be also encountered which originate from actual instabilities even at the semiclassical level (such is 
the case in the theory of gauge fields presented in Chap. 12). By this we mean that to develop a 
perturbation series we have to select one among several degenerate minima of the classical energy. 
The divergence of the series then reflects the quantum tunneling between these ground states. 
Classical euclidean solutions to the field equations with finite action J interpolate between these 
states and contribute a transition amplitude e ~!/". To construct a meaningful theory it is then 
necessary to remove this degeneracy by introducing additional quantum numbers. 

It is also possible that extra singularities occur as a result of renormalization. These seem 
to raise difficult problems about the consistency of renormalizable field theories. 


NOTES 


The idea of introducing path integrals via the superposition principle in quantum 
mechanics originates in the work of P. A. M. Dirac, Physikalische Zeitschrift 
der Sowjetunion, vol. 3, p. 64, 1933, and was developed by R. P. Feynman, Rev. 
Mod. Phys., vol. 20, p. 367, 1948. An expanded version is found in the book by 
R. P. Feynman and A. R. Hibbs, “Quantum Mechanics and Path Integrals,” 
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McGraw-Hill, New York, 1965. The treatment of fermions was given by F. A. 
Berezin in “The Method of Second Quantization,” Academic Press, New Youk, 
1966. Among the numerous recent presentations we may quote the lectures of 
J. Zinn-Justin at the 1974 Bonn Summer Institute, edited by H. Rollnik and 
K. Dietz, “Trends in Elementary Particle Theory,” Lecture Notes in Physics 
No. 37, Springer-Verlag, Berlin, 1975, and those of L. D. Faddeev, Les Houches 
Summer School 1975, “Methods in Field Theory,” edited by R. Balian and 
J. Zinn-Justin, North-Holland, Amsterdam, 1976. The approach through func- 
tional derivatives has been developed by J. Schwinger, Phys. Rev., vol. 82, p. 914, 
1951. 

For a discussion of the eikonal approximation in field theory see, for instance, 
H. D. I. Abarbanel in “Cargése Lectures in Physics,” vol. 5, edited by D. Bessis, 
Gordon and Breach, New York, 1970. 

The effective potential is discussed in the paper by S. Coleman and E. Weinberg, 
Phys. Rev., ser. D, vol. 7, p. 1888, 1973, and in the Erice 1973 lectures of S. Coleman 
entitled “Secret Symmetry” in “Laws of Hadronic Matter,” Proceedings of the 
Eleventh Course, International School in Physics “Ettore Majorana,” edited by 
A. Zichichi, Academic Press, New York, 1975. The calculations quoted in the 
text are described in J. Iliopoulos, C. Itzykson, and A. Martin, Rev. Mod. Phys., 
vol. 47, p. 165, 1975. 

P. A. M. Dirac gave the first general discussion of the quantization of con- 
strained systems, in Proc. Roy. Soc., ser. A, vol. 246, p. 326, 1958. It was elaborated 
in the context of path integrals by L. D. Faddeev, Theoretical and Mathematical 
Physics, vol. 1, p. 1, 1969. 

Vacuum instabilities and their relation to the divergence of the perturbative 
expansion were suggested by F. J. Dyson, Phys. Rev., vol. 85, p. 631, 1952. An 
early study of Borel summability was made by A. M. Jaffe, Comm. Math. Phys., 
vol. 1, p. 127, 1965. It has been proved for the anharmonic oscillator by S. Graffi, 
V. Grecchi, and B. Simon, Phys. Lett., ser. B, vol. 32, p. 631, 1970. Formulas 
for the anharmonic oscillator were obtained by C. M. Bender and T. T. Wu, 
Phys. Rev., vol. 184, p. 1231, 1969; Phys. Rev. Lett., vol. 27, p. 461, 1971; and 
Phys. Rev., ser. D, vol. 7, p. 1620, 1973. Quantization around a classical extremum 
is discussed in R. F. Dashen, B. Hasslacher, and A. Neveu, Phys. Rev., ser. D, 
vol. 10, p. 4114, 1974. 

Large-orders estimates in field theory are studied by L. N. Lipatov, JETP, 
vol. 45, p. 216, 1977, and E. Brézin, J. C. Le Guillou, and J. Zinn-Justin, Phys. 
Rev., ser. D, vol. 15, pp. 1544 and 1558, 1977. 


CHAPTER 


TEN 


INTEGRAL EQUATIONS AND 
BOUND-STATE PROBLEMS 


The purpose of establishing integral equations among amplitudes is to study non- 
perturbative properties. In particular we wish to find a suitable framework for 
the investigation of relativistic bound states. We shall concentrate our attention 
on this aspect leaving aside other interesting applications. The formalism is 
described in some detail and is illustrated on the example of the hyperfine splitting 
of positronium. 


10-1 THE DYSON-SCHWINGER EQUATIONS 


Using different techniques, Schwinger and Dyson have derived independently 
integral equations for Green functions as a consequence of the field equations, 
ie., the specific structure of the dynamical lagrangian. This was already en- 
countered in Chap. 9 for a self-coupled scalar field. 

When properly renormalized these equations may be used as an alternative 
approach to perturbation theory. A closer look reveals that the system involves 
an infinite hierarchy of equations. Consequently, apart from the discussion of 
general properties their usefulness is limited by the approximations needed to 
bring them in manageable form. This unfortunate circumstance may be traced 
to the fact that the natural mathematical tool in this context is the functional 
calculus, a not too familiar instrument. 
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10-1-1 Field Equations 


For definiteness we shall deal with electrodynamics, but the technique is general. 
We introduce as usual a generating functional with a source J,(x) for the electro- 
magnetic potential and anticommuting n(x) and n(x) sources for the electron- 
positron field. A path integral expression for this functional is 


ZI, 0, 9) = e0™? = | au, WW) e 
(10-1) 


Ce | dx [JuxAMx) + COW) + HINO] 


Here J is the action as a function of A, w, w given in (6-25), and a normaliza- 
tion factor is understood to insure that the connected functional, where we omit 
the index c for shortness, satisfies G(0) = 

To cope with renormalization, the action may be regularized and counter- 
terms included. For the sake of simplicity these will not be explicitly spelled 
out until the end of the derivation. 

Field equations will follow from the observation that the integral of a derivative 
vanishes. We have, for instance, 


0= | 914, 1D) ot a 49 | gia 


66 (1 6 6 ; 
=|. (a ion’ -z.)+ Jo) [20m (10-2) 
Write 


e 


I 4 a 
vue = [Ogu — (1 — 4)0,6,] A” — edvy ap (10-3) 


Then Eq. (10-2) reads 


6G 6G 6 OG 
v— ( 0n0 —y —e—ly, —]= - 
x)+ [Og, — A)é,0,| ——-~ =, i —e egg “ (> =) 0 (10-4) 
Itis advantageous to perform the Legendre transformation to irreducible functions 
GJ, n, f) = (A, WW) + i [es (J-A+ yn + A) (10-5) 
satisfying 
1 6G 1 6G = 1 “6G 
A,(x) =-= x) = —- —— a2 
1) 2 2 te 
‘ 5AM) buy BUG) 
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Inserting these definitions in (10-4) and recognizing that 


— bap 0*(x — nat lars <— a 
OES i dna(x)OH,(2) dW, (z)dW p(y) 


(10-7) 


a 
=0 


we may reexpress it when the fermion sources vanish as 


a = [Dow — (1 — Aa,a,) A"(s) — | (= ar »| 
PAA ee0 Guv — 1 (as iestr | yy Swow bas! (10-8) 


The inverse of 5°T/dwW(x)dW(y) is the electron propagator including radiative 
corrections in the presence of an external field. We take one derivative with respect 


to A and set it equal to zero. In this process we encounter the irreducible vertex 
function (7-46) 


6 6-r 
SAM) SYNE) 
the vacuum polarization amplitude 
oe O 
5A*(x)5A”(y) 


= eA, (x;y; 2) (10-9) 
Aw P=0 


me [Ogu a era 4)0,0v] 54(x — y) + (gp — 0,0) @(x — y) 
AwW=0 
(10-10) 


and the complete electron propagator iS(x, y), satisfying the relation (Fig. 10-1) 
(G..01 — dyna, y)= we? [ated 425 tr [YuS(x, 21) Av(y3 21, Z2)S(Z2, x)] (10-11) 


The normalizations are such that to lowest order 


A\(y3 21, 22) = prO*(y — 21) O4(y — 22) + °° 


- d*p oe tp: (x—y) 
Sto= | ae 


In other words, Eq. (10-4) is a sophisticated form of Maxwell’s equations of 
which (10-11) is only a small part. Higher functional derivatives would lead to 
further relations. The graphical interpretation given in Fig. 10-1 could, of course, 
be used for a direct derivation. Similarly, Dirac’s equation will follow from the 


(10-12) 


Figure 10-1 Integral equation for the vacuum polarization. 
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identity 


o= {2 i.) ee 76 ae fe" 


= ja (z. a - 2) +09 zaman (10-13) 
dy(x) \toJ dn idn 


It is convenient here to work with complete Green functions without disconnected 
photon amplitudes, obtained as derivatives of Z(J,n, )/Z(J, 0,0). For shortness 
we write Z(J, 0,0) = Z(J) and obtain 


i 


5 6 
> a ee Zs; ’ = 0 (10-14 
nc + (i ies arg al aaa | 


We take a derivative with respect to 7 and set yn = 7 = 0 


d*(x — y)ZU) — E —m— ey" srs | Z(J)S(x, y; J) = 


where S(x, y; J) describes the propagation in the presence of the source J. If we 
denote 


1 6 
Q =1 = 
A,(x; J) = Os ain Sy BW) = 5 5") G(J, 0, 0) 
this equation may be rewritten 
o 
CY — || Se) = . _ Lu )° = Bil 
o*(x — y) E m — eA(x;J)— e) agg [Soren 0 (10-15) 


with a Kronecker 6 function for spinor indices omitted. In (10-15) the differentiation 
with respect to J can be performed, the source set equal to zero, and as a con- 
sequence A(x; J)| =o = 0. This yields 


(i9 — m)S(x, y) — ie” | ate a*x' d*y’ yyG"(x, 2)S(x, x’) 


x A(z; x’, y)S(y, y) = O%(x—y) (10-16) 


since the complete three-point function involves the irreducible vertex convoluted 
with propagators for the two fermions and for the photon 


hy So ee itudi 3 
G(x, y) | e PTL + a] + longitudinal terms| (10-17) 


Equation (10-16) is best understood as giving a meaning to the self-mass operator 
in the form 


[Gd — m — }°)S] (x, y) = 54*(x — y) (10-18) 
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Figure 10-2 Representation of the self-mass operator. 


with 
>: (x, y) = ie? [ats ax y.G"(x, z)S(x, x)Av(z; x’, y) (10-19) 


This formula is illustrated in Fig. 10-2. 
If we set 1 = 7 = 0, we may rewrite Eq. (10-4) in the form 


[Ogu — (1 — A)6,0,] A(x; J) = —J,, — ie tr [y,S(x,x3J)] (10-20) 


In conjunction with (10-15) this yields a closed functional system where the 
argument is the source J. Alternatively, we may write an equivalent system with 
an external potential A as argument. For this purpose let G,,(x, y; A) be the 
photon propagator. Differentiating Eq. (10-20) with respect to J we obtain 


[O.gu” — (1 — 2)6,0"] Gy (x, y; A) 


+ ie [ets tr |, eee Gy lz, Y; 4) = Gupd*(x — y) (10-21) 


while (10-15) may be rewritten as 


6S(x, y; A) 


[id — ~ — eA(x)]S(x, y; A) — ie [ats VnG"(x, z; A) ee 64(x — y) 


(10-22) 


Again this is a complete set to determine S and G. These remarks are unfortunately 
of little help due to our inexperience in handling such expressions. 

Returning to Eq. (10-14) we can obtain further information if we take higher 
derivatives with respect to spinor sources. Of great interest is the Green function 
involving the propagation of two charged particles: S(x1, x2; 1, 2;J). Proceed- 
ing as before we find 


6 
E —m— eA(x1;J)— ey" aay Se X25 ¥1 Y23J) 


= 6*(x1 — y1)S(x2, yo; J) — 64(x1 — y2)S(X2, 13 J) (10-23) 
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a <r: 1G 


Figure 10-3 The kernel V of the Bethe-Salpeter equation. The slashes indicate truncation of the 
corresponding propagator lines. 


The antisymmetric combination on the right-hand side is a reflection of the 
Pauli principle. Taking into account the relation (10-18) satisfied by the two- 
point function, we may act on the variable x, with the operator i¢ — m— yy. 
(For different applications we might be interested to act on one of the two 
remaining variables y, or y2.) At any rate we find, when setting J = 0, 


(idx, Si Y)(id, = tie >) S@1, X25 Vi5 y2) 
= 6*(x1 — y1)6*(x2 — y2) — 64(x1 — y2)d*(x2 — y1) 


Sr =, lor ses 5 S(x1,¥13%2,¥23J)|s=0 (10-24) 


The first term on the right-hand side is readily traced out as the contribution 
of the disconnected amplitude. The second term may best be understood from a 
graphical analysis. Written as.a convolution integral 


[ate az V(x1, X23 21, Z2)S(Z1, Z23 V1, V2) 


it involves a kernel V(x;, X23 1, y2) described in terms of truncated four-fermion 
diagrams which cannot be disconnected by cutting two fermion lines (Fig. 10-3). 
To lowest order, and adding superscripts to y matrices to recall on which indices 
they act, 


V(x1, X25 V1, V2) = ie? YW G"*(x1 — x2) y?)54(x1 — y1)54*(x2 — y2) (10-25) 
With this notation we write the resulting Bethe-Salpeter equation in the form 
(9x, — m — )/) (ix, — m — Y)S(x1, X25 Yi, y2) 
= 0*(x1 — y1)6*(x2 — ya) — 5*(x1 — y2)d*(x2 — y1) 


a5 [ats d*z, Viet, X23 21, ZS ess 15 Y2) (10-26) 


Figure 10-4 Bethe-Salpeter equation. 
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{t is represented graphically in Fig. 10-4 where the slashes indicate amputations 
on the corresponding propagator lines. 

As it stands this equation has been written in the doubly charged electron- 
electron channel. It can, of course, be established in the crossed electron-positron 
‘channel where it is suited for the investigation of positronium bound states. 


10-1-2 Renormalization 


The field equations obviously need renormalization. To avoid cumbersome 
notations we did not introduce the counterterms in the action. They have now to 
be reinstated. As this is not our main concern here we shall not elaborate these 
points in great detail. We only sketch the modifications brought upon the equations 
by taking the multiplicative renormalization of Green functions into account 
when expressing them in terms of the physical mass and coupling constant. 
Consider, first, Eq. (10-15) for the electron propagator. In terms of electron 
wave function (Z,) and vertex renormalization (Z,) this should properly be written 


r) 
Pho ah— <ce —m)— [ yt _—__ yiJ)=0 10-27 
O"(x — y) ali m) eZ | Mo D4 aol Sx, v5 J) ( ) 


The Ward identity requires Z, = Z», but both are infinite quantities. Similarly, 
the photon propagator equation (10-21) should read 


Z3{ Dsgu” — 010” |Gy (x, y) + 10,0" G, (x, y) 


+ iZ,e | ate tr E ok) G, zs »| = Gupd'(x — y) (10-28) 
6A”(z) 
implying again the infinite renormalization constants necessary to obtain finite 
results when iterating these equations for the finite Green functions G and S. . 
This is a rather unpleasant situation, except if we return to the perturbative 
expansion. 


Exercises 


(a) Derive the renormalized forms of Eqs. (10-11) and (10-19). ee 

(b) Relate the vertex function to the Bethe-Salpeter-kernel and examine the renormalization properties 
of Eq. (10-26). a 

(c) Obtain the equivalent equations in the ¢* model and discuss their renormalization properties. 


10-2 RELATIVISTIC BOUND STATES 


In the relativistic approach, bound states and resonances are identified by the 
occurrence of poles in Green functions. A simple extension of the Schrédinger 
equation is unfortunately not available, except in limiting cases such as those of 
static external sources discussed in the framework of Dirac’s equation. 
The general difficulty is in fact twofold. At first we have to face retardation 
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effects which introduce an extra relative time variable in the problem. An alter- 
native description uses the mediation of a field. But the quantum aspects of the 
latter cannot be ignored. Thus it appears that the mere concept of a two-body 
bound state, for instance, is only an oversimplification of the actual situation. 
In spite of various artefacts -which may be of great practical importance—it 
remains true that we have to return to the general field theoretic framework 
whenever we need a fundamental description. This is also the case if we want 
to introduce further radiative corrections. 

A great amount of work has been devoted to the subject of which only a small, 
but hopefully representative, part will be surveyed here. We recall that we have 
already discussed aspects of hydrogen-like bound states in Chap. 2 and computed 
the Lamb-shift corrections to lowest order in Chap. 7. 


10-2-1 Homogeneous Bethe-Salpeter Equation 


Instead of dealing with the full complexity of the spinor problem, we shall examine 
for the time being a simpler model of scalar particles interacting through the 
exchange of another type of scalar particles. This is, of course, a theoretical 
exercise in order to exhibit some of the features of the real problem. The kernel 
V has also to be truncated. Moreover, we shall ignore the effects of statistics by 
assuming that the two “charged” particles are of a different kind. 

In symbolic notation Eq. (10-26) may be rewritten 


$42 = sMg@ 4° gg) ygu2) (10-29) 


where S") stands now for the complete propagator of particle 1, soon to be 
approximated by a free propagator involving the physical mass, and renormaliza- 
tion effects have been discarded. Defining — D as the inverse kernel of the product 
SS) this equation could formally be solved in the form 


$02) = (DV)! (10-30) 


showing that poles may occur whenever D + V has a zero eigenvalue. We are 
thus led to a homogeneous equation describing the properties of bound states. 
To be precise we define 


SO) = CO] Te1(x1)2(x2) 91 (91) 93(y2) |O> 
say <0| Te1(x1)e4(y1) |O> 


A bound-state contribution of mass M to S‘'*) (assumed nondegenerate for 
simplicity) will be of the form 


(10-31) 


d3P ; 
| (2n)32 p2 + M2 XP(X1, X2)XP(V15 y2) (10-32) 
with 
XP(X1, x2) = <0| TP 1(X1)2(x2) |P> 


oly Ya) = <P] T@1(y)04(y2)|0) = <0| For(i)oaiv2)|P* 
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Here T denotes antichronological ordering. The generalization to a set of degener- 
ate bound states is straightforward. Approximating S" and S? by free propagators 
: d*k i 
Se Bik (x) ~y) 
|e k? — m2 + ie“ fe 


we find 
(Tie ls. + Madge ieese) + ce d*z. V(x4, X25 21, 22)xp(Z1, Z2) = 0 


(10-35) 
NeW Ya)Mlaly, + m?)(y, + m3) + |i d*z> Xp(Z1, Z2)V (21, 22; V1, y2) = 0 


Equations (10-35) will be sufficient to illustrate the type of reasoning when dealing 
with Bethe-Salpeter equations. 

In spite of some similarity with the nonrelativistic Schrodinger case, this is a 
very different type of equation. It is reflected in the larger number of configuration 
variables, in the fact that we deal with fourth-order integro-differential equations, 
in the presence of a kernel V which has to be specified perturbatively, and in 
the way the energy momentum of the bound state enters the equation. 

From translational invariance, it follows that 


8 p(x1, X2) (10-36) 


It is natural to introduce the relative space-time coordinate x = x; — x2. How- 
ever, the overall configuration variable is a priori arbitrary. We may choose two 
positive quantities 4, and 742 such that 


xp(X1 + a,x2+a)=e” 


+2=1 x1 = X + 72x 
ee ‘ 2 (10-37) 
X = 1X1 + 2X2 X2 = X — mx 


a transformation with unit jacobian, and write the reduced Bethe-Salpeter ampli- 
tudes as 


-iP+X 


Xp(x) 
Xp(%1, X2) = e ** ¥p(x) 


According to (10-33) y and x are not to be confused with wave functions 
but stand rather for generalized form factors. The normalization conditions are 
therefore not straightforward, since they involve the relative time variable xo. 
This innocent-looking question has been the subject of a long elaboration. The 
role of normalization is, of course, to provide the correct relation between the 
function and the four-point Green function. Furthermore, it is essential in selecting 
the proper solutions to Eq. (10-35). We return to the inhomogeneous Eq, (10229). 
Introducing the overall momentum of the pair (1,2) through 


Mi a 
Xp(X1,X2) =e (10-38) 


SO Ge is | {ea e!PaS2)y, + a, X2 + 43 V1, Y2) 
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the bound state and its CPT transform contribute a pole in the variable P?: 


ixp(X1, x2)Xr(V1, ya) (10-39) 


Bix Xos VG y2|P) = P? — M + ie 


with R regular in the vicinity of P? = M?. The factorization property of the pole 
residue is crucial for the interpretation in terms of bound state and has to be 
suitably generalized in the case of degeneracies. 

We iterate the equation (D + V)S“2) = —1 in the form S?(D + V)SO”) = 
—§2) insert (10-39), and use the fact that (D + V)z = 7(D + V) = 0 to compare 
the residues of both sides at P? = M?. The result is symbolically 

x(D + Vx 


lim ———=i 
P24 M2 P? — M2 


or, equivalently, the covariant expression 
il soe (D + ny = 21b, (10-40) 


where the left-hand side involves an integral over relative variables. In general 
the normalization condition depends on the “potential” V as opposed to the non- 
relativistic case. 

It is useful to have these equations also written in momentum space. Let p 
denote the conjugate variable to x. According to (10-37) we have 


P=p, + p2 P= apis (10-41) 


where p, = 4,P + p and p2 = n2P — p refer to momenta carried by the fields 
~, and @~> (Fig. 10-5). We have here no natural definition of the relative momentum 
as given by the separation of variables in a nonrelativistic motion, which results 
from the specific choice y;,2 = m,,2/(m, + mz). In the relativistic case this choice 
may, however, be a good candidate for the purpose of comparison. Using the 
same symbols for the Fourier transformed quantities and taking translational 
invariance into account, we have 


Xp(p) = |e “*Xp(x) d*x 


[iP + p)? — mi} [(n2P — p)? — m3] yxr(p) + \<2 eT VO, p> Ppp) 0 


(2n)* 
(10-42) 
p)[(mP + py? — mt] [(n2P — p)? — m3] + (oe One xr(p)V(p', p; Py= 


The exchange of a scalar particle of mass » and coupling constants g, and g» 
to particles 1 and 2, that is, such that the corresponding interaction lagrangian 
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Py =1P—p 


Figure 10-5 Homogeneous equation for the bound-state amplitude y. 


6 Se (G1 9191 + g2 P202)d, leads in the Born approximation to 


t igi 92 
V, ’ ; EF = 
PPE) = — Gray + 
independent of P. Explicitly, the normalization condition reads 


d*p d*p’ _ enh; 2 2 2 2 454 ' 
Qn) Qn)* xe(P’) ape i{niP + p) Sn iaP — py mie |(2n)"0"(p — p) 


+ V(p’, p; P)} yp(p) = 2iP, (10-43) 


With Eqs. (10-35) and (10-40), or equivalently (10-42) and (10-43), we have now 
the basic ingredients to study some definite models. 


10-2-2 The Wick Rotation 


Bound-state equations are derived in Minkowski space. In his early study, Wick 
was led to an analytic continuation to euclidean variables which was at the origin 
of the Wick rotation. It is easy to justify this procedure perturbatively, Le., 
precisely, ignoring the possibility of new singularities such as those investigated 
here. Some care is required in order to extend it to the present situation. The 
essential physical point is to insure that stability criteria are met. Of course, the 
desirability of using this trick is to bring the equation in a form convenient for a 
simpler analysis. 
Let us proceed in a straightforward manner by writing, for X = 0, 


p(x) = O(x°) f(x, P) + 6(—x°)g(x, P) 


(10-44) 
pple) = Ox )g*(x, P) + 6(—x°") f *(x, P) 
with 
f(x, P) = 0] @1(n2x)@2(—mx)|P> = | ee ass (02 2) 
ae (10-45) 
g(x, P) = <O| p2(—11x) G1(N2x)|P> = Fa Sm GD: P) 
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Using 
(eae = (10-46) 
2in @ +i 
we obtain y and y¥ in momentum space 
1 cACTS Pp, P) 1 9(4o, Pp, P) 
os AN oe 
XelP) 2in | ea p°—q° + ie i. eon p° —aq?— ie 


(10-47) 

a 1 Sips (Go. p, Sle 1 9* (qo; Pp, P) 
dqo So) EC oon ap dqo 0 0 

po—qot+ ma 217 p —q" SMe 
The relation between y and { is such that their discontinuities are conjugate. 
A rotation to the imaginary axis will be allowed provided fand g have a suitable 
support in the variable go. This follows from their definition by inserting inter- 

mediate states 


f(g, P) = | d*x e * Y 0] 910)|n><n| 2(0)| P> e-»—mP)-= 


s=™@! a 


(270)*6*(q — Pa +1 P)<0|91(0)|n><n| @2(0)| P> (10-48) 


g(p, P) = > (2x *5*(q + Pa — n2P)<O|@2(0)|n><n| @,(0)| P> 


The stability of particle 1 requires in the first expression pe 2m}, p? > OMPhis 
means that f(g, P) vanishes totes (miP + 4) => mj, 71 Po + qo > 0. Likewise in 
the second expression p? > m3, p? > 0, hence g(q, P) vanishes unless (y2P — q)* > 
m3, 2Po — qo > 0. In the representation (10-47), the integral of f (or f*) extends 
from @, to +00, the one of g (or g*) from — co to w_, with 


ll 


/mi + (mP + p)’ — Po 
wo = —,/m) + (y2P — p)? + n2Po 


M+ 


(10-49) 


A rotation to the imaginary axis without encircling singularities will be possible if 


w- <a. For a stable bound state Po < ./(m, + m2)? + P?. If we choose the 
center of mass frame P = 0 and n1,. = m,,2/(m, + m2), then 


m,Po M2Po 


>O0 and = w. =———-—,/m3+p*?<0 


ame. 2 
wo. =./mp+p es 
m, + M2 m, + m2 


Consequently, we have a gap between the two cuts in the integral representa- 
tions (10-47). 

We may now return to the Bethe-Salpeter equation and perform the Wick 
rotation using these results. For simplicity we retain the center of mass frame 
and the previous assignments for 7,;,. Furthermore, we limit ourselves to the 
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so-called ladder approximation, ie., the kernel V is evaluated in the Born approxi- 
mation 


19192 
oS ae : 
(p a py = ie a ig (10 50) 


' The equation takes the form 


doa Pome - 2 17 "iti P : 2 ) 
2 =p" =i —— = P= _p— 
m tm ° si P ' ime © Po p* — m2 | xXP(p) 


igG2 | iy —__20) 


(2x)* papy oie ) ) 
Consider the analytic continuation p° + p® e”® with @ varying from 0 to 1/2. For 
the first term in (10-51) we use the representation (10-47) and examine separately 
the cases p° > 0 and p° < 0. Using the properties w. <0 <4 we see that no 
singularity is encountered in the process of analytic continuation. For the second 
term we proceed to a simultaneous rotation p® > p® e!®, p'° = p’° e® and for the 
same reason we do not encounter any singularity from yp(p’) nor from the 
denominator. Using abusively for 6 = 2/2 the same notations for the functions 
of the euclidean argument the final result assumes the form 


A 2 ° 2 
ny 2 2 1m2 2 2 
(0 m, + M2 | “ | | (» m, + M2 | : fet 


9192 righ xXP(p’) 
— d*p' ——_—..—_, = 0 (10-52) 
| Po —pye te 


The metric used in (10-52) is the euclidean one (p — p’)? = Yj (pi — pi)”. 


The extension of the preceding method to higher kernels requires a more elaborate analysis in several 
variables. A similar study may also be carried out for the scattering case where part of the integration 
contour remains pinched between the two cuts. We leave it as an exercise to the reader to examine 
these cases. 


10-2-3 Scalar Massless Exchange in the Ladder Approximation 


We present the solution obtained by Wick and Cutkosky for the equation (10-52) 
when «? = 0 as an illustration of the methods and problems encountered in 
studying the relativistic bound-state equations. This choice is not completely 
arbitrary from the physical point of view since in spite of the oversimplifications 
resulting in part from the neglect of spin (particularly of the exchanged field) 
the situation has some analogy with the realistic cases such as positronium. 
Furthermore, it allows an almost analytic solution and exhibits peculiarities 
which may arise in more complex models. 

The set of ladder diagrams looks at first as a natural generalization of non- 
relativistic potential theory. However, such an approximation overlooks essential 
features of a relativistic quantum theory. By neglecting the crossed-ladder dia- 
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grams, it violates (s—u) crossing symmetry. It implies a correspondence between 
the proper time ordering of the relativistic interactions, thus mistreating the 
relativistic static limit when one of the masses gets large. This is a severe limitation 
when dealing with realistic problems. For instance, in electrodynamics, unless we 
find some compelling reasons to use it in a particular gauge (the noncovariant 
Coulomb gauge for instance), this approximation will not be gauge invariant; 
nor will it satisfy the reasonable criterion of reducing to the Klein-Gordon (or 
Dirac in the spin 4 case) equation when one of the particles becomes very massive. 


To fulfill this criterion it is necessary to include at least the set of crossed-ladder diagrams leading 
to an infinite kernel V. Find the corresponding approximation in the functional language of Eqs. 
(10-23) and (10-24). 


With these limitations in mind we return to Eq. (10-52), where we set pai, 
Wick and Cutkosky, observing the analogy with the hydrogen problem in 
momentum space, suggested the use of the stereographic map on a unit sphere in 
five-dimensional space. This method introduced by Fock in the nonrelativistic 
case enables us to use conformal transformations in a transparent manner and to 
exhibit the dynamical symmetry of the system. It is convenient to introduce the 
notation A for the dimensionless quantity 


9192 


~ 16mym27 (Ome) 


This choice reminds us that the g are dimensioned ¢? coupling constants. 
It allows a comparison with the electromagnetic case, 1 being the analog of the 
fine-structure constant. We shall first content ourselves with the equal-mass case 
m, = m2 = m. Taking m as the energy unit, we find 


PP me \? A , Xr(P') 
|(o- =) aS 1 |{(o+:) ap t feat = (et May (10-54) 


where P stands for (Po, 0). This may be regarded as an eigenvalue equation for 
the coupling 2 and appeal to Fredholm’s theory will show that it admits a 
discrete spectrum. The stereographic projection to be considered in more detail 
in Chap. 13 amounts to an association of a point p in R* to a vector z in five- 


dimensional space on a sphere of radius 4,/1 — P*/4 with four-dimensional pro- 


Figure 10-6 Projection from the four-dimensional p 
space onto a sphere of diameter a 1 — P?/4, 
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jection along p and polar angle ¢ such that tan (/2 = |p|/./1 — P?/4 (Fig. 10-6). 
We use (B, 8, p) as the additional polar angles of p; hence p- P = | p| | P| cos B, and 


6-ele F data 


2 2 


P ie 
x (: = oe sin? { cos? 6) (10-55) 


If dQ, denotes the solid angle element on the sphere normalized according 
to J dQ, = 8n?/3 then 
d‘*p’ 1 — P?/4) cos? ¢/2 dQ’ 
peg — FID) 08 t/ 4 (10-56) 
(p—p’) 8 cos® ¢'/2 1— cosa 
where x is the angle between the corresponding vectors z and z’, namely z-z’ = 
[(1 — P*/4),4] cos x. Finally, we define a new function K(z) through 


K(z) (p) (10-57) 


1 
~ cos® (/2 x 
It follows that 


| eg ee 2 A AQ, 
ee eee et (y! 10-5 
(: 7 + q sin € cos* B ) K(z) = ce oe K(z') ( 8) 


The limiting case P? = 0 admits an O(5) invariance with K proportional to a five- 
dimensional spherical harmonic. Let N — 1 stand for the degree of this spherical 
harmonic, which means that multiplication by |z|"~* yields an harmonic function 
in five-dimensional space. To compute the eigenvalues [with degeneracy 
N(N + 1)(2N + 1)/6, N =1,2,...] it is sufficient to apply (10-58) to special 
spherical harmonics depending only on the angle with the fifth axis. Those are 
orthogonal polynomials (generalizing the Legendre polynomials) obtained by 
expanding in powers of z</z, the elementary Green function 


1 1 


E-eP B+ Weslo 


1 00 
eo ae, ns ee _ tN-1 
[1+ t? — 2t cos a)?!” De Cy —1(cos a) 


Thus, choosing z along the fifth axis, Ay is given by 


_ Aw fag, Cnei 6) _ aw [7 a sin® ¢ C1) 
Cyl) = 2% | am, SA 1 —cos¢ 


or, equivalently for |t| < 1, 
gaa! 1 \, dt sin? C 1 1 


2 An Oe eee Gg Sr cos 2 In — 
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From its definition (10-59), Cy—1(1) = N(N + 1)/2; thus 


“NIN +1) (10-60) 


In the general case 0 < P? < 4, Eq. (10-58) exhibits an O(4) invariance very much 
as in the hydrogenic case. This follows from the fact that only the combination 
sin ( cos f, proportional to the projection of z on the zeroth axis (i.e., along P), 
enters the equation, apart from O(5)-invariant expressions. Thus the O(4) 
invariance pertains to rotations leaving the zeroth axis invariant. 

The unit sphere in R®° can then be projected back on the euclidean space Rt 
but now from a point along the zeroth axis. If ) is such that cos ) = sin ¢ cos f, 
let g be the projected four-vector from the unit sphere with |q| = tan (y/2) and 
7(q) = cos® (y/2) K(z). Performing similar transformations as in (10-55) and (10-56) 
we find a manifestly O(4)-invariant equation 


p? | = 
4 2| aia 1 ae | ae eee 10-61 
E 2G (: 5 )+ |x a \z a x(q) ( ) 


The transformation from p to g and x to x can, of course, be performed directly. If m is a unit 
vector along the zeroth axis we have 


a ue 3 q-n 
oe rae g? —2q-n+1 


2 


P 3 
— =) (q? — 2q-n + 1)x(q) 


(10-62) 
and xp) = (: 


The invariance of Eq. (10-61) may be used to perform a partial wave analysis 
in R* using the O(4)-orthonormalized spherical harmonics Ynim(q). The positive 
integer n > 1 indexes the representation of O(4) of dimension n*, analogous to the 
principal quantum number of atomic physics. The ordinary orbital momentum / 
takes values between 0 and n — 1. 


If O(4) is identified with SU(2) x SU(2)/Z, then the ~/,;,, form a basis for the representation (j,j) 
with n = 2; + 1. The O(3) subgroup unmodified by the combined transformation (10-62) is the diagonal 
subgroup of SU(2) x SU(2)/Z2. 


In R* we have a decomposition analogous to (10-59), that is, 
1 Qn? (oo) z jak 1 
Fu Rh = le a a Ym (G) Yam (G -63 
(q—qP qe 3 () mp PCG (10-63) 
Define a radial amplitude R,(q) through 
ne R,(q’) 
x(q) Ree oes ele 2q7(1 = P=) ab 1] 


Then (10-63) and (10-64) when inserted in (10-61) reduce it to a one-dimensional 
equation in the variable x = q’: 


Yrim() (10-64) 


INTEGRAL EQUATIONS AND BOUND-STATE PROBLEMS 491 


A R(x’) 
Ragone: - 
bs) nt [pa |e a (3 “J sities “| Ngee Pe I 


(10-65) 


. equivalent to the differential ot 
—_ (a = ia Ry 
ee + 2x(1 — P?/2)4+1 


supplemented by the conditions that R,, vanishes as x" for x > 0 and R,/x" goes 
to zero as x > 00. 


=0 (10-66) 


Wick and Cutkosky use the variables 


= (10-67) 
wet | 
ranging from —1 to +1 and the function g, such that 
1 
R,(x) = ——,, ga(2) (10-68) 


(i z)" 
to write the equivalent forms of (10-65) and (10-66): 


ssa) UO (i a2zV]__ ale 
wera +(e 973) Ce tee ere ae 


Pe d° . A i _ c 
ja ) + An 1)z = —n(n-1)+4 reas |ae=0 (10-70) 


and 


with g,(+1)=0. 


Standard analysis applied to either (10-66) or (10-70) shows that /, has a 
discrete spectrum for 0 < P? < 4 which may be numerically computed. For fixed 
n(and P*) we have a sequence of solutions indexed by an integer K (K = 0,1,...) 
which is équal to the number of zeros of the radial function (excluding boundary 
points). 

The unequal-mass case may be given a similar treatment, exhibiting again 
an O(4) symmetry and mapping it, in effect, on the equal-mass case. We keep the 
assignment y;,.. = ™,,2/(m; + m2) and define the following quantities: 


m, — M2 
m, + mp 


: Pp? 
=(1—- A?) [ _ ae 7 oF | 
i A? — e”)(1 — A?) (10-71) 


cA 
p= ao = {sin O(1 + q?)n + 2g — 2n[(1 + cos #)q-n + sin 6]} 


tg0=i 


D=1+cos6+(1 —cos 6)q? —2sin6 q-n 
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The unit vector n is along P and these formulas define a complex conformal 
transformation. We deform the contour of integration in such a way that the 
final equation satisfied by 


x(q) = D-*7(P) (10-72) 


is real and reads 


2¢? ‘ / . ee 
4 2 a Fl 1$7(g)= = | dg — . (10-73) 
‘2 + 2q F ee | {nm ae i. q ca 


This is similar to (10-61) to which it reduces for m, = mp since 


2e? i 2 ' Be 1 
(ie ~ 1 —(m, — m2)*m, + m2) (mm, +a)? 

if we recall that in Eq. (10-61) the common mass m was taken as a unit of energy. 

Thus by an appropriate change of variables we can translate the results of 
the equal-mass case into those pertaining to the unequal-mass case. We return, 
therefore, to the former and to Eqs. (10-66) or (10-70). 

Of special interest is the behavior close to the threshold P* = 4 where we 
can approximate the kernel 


1 7 1 eae 
n[x? + 2x(1 — P?/2)+ Tf] x[(x — 1)? + x(4 — P?)] p24 </ e 
Thus, from (10-65), 
R,(x) ~ a = R,(1)[6(x — 1) + x"0(1 — x)] (10-74) 
n 


/4-—P 
Demanding consistency, i.e., that both sides agree for x = 1 [we set formally 
6(0) = 4], requires 


——— 
J4- P= (10-75) 


which is the familiar hydrogenic spectrum in terms of B, the binding energy 
Pa (2 aan) 
iia 


= ue (10-76) 


where m/2 is the reduced mass and / is identified with e?/4z. Unfortunately 
this is not the end of the story. 

The expression (10-74) yields a function without nodes (K = 0) and therefore 
does not give the limit of the eigenvalues corresponding to K > 1. Those cor- 
respond to abnormal solutions for which 2 does not vanish in the limit of zero 
binding and there is therefore no nonrelativistic analog! Wick and Cutkosky 
have shown that the corresponding eigenvalues for a function with K nodes 
tend in the limit P? — 4 to an n-independent limit 


AK 1 1 


mp4 [ind-Pypjpe *K21 Ue 
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This is not the only disease of this model. It is possible to study corrections 
to the sensible set of solutions (K = 0) which reproduces to lowest order the 
nonrelativistic result, Eqs. (10-75) and (10-76). The result of this analysis shows 


that 2ne/A, with e = ./1 — P?/4, has an expansion of the form 


2né 
= 1 + e(ayy In € + ay2) + e?[azi (Ine)? + az2 Ine + a23]+-°*+ (10-78) 


with ¢ of order 1. These logarithmic terms are not present in a more physical 
approximation than the ladder series. 

If this were not enough, Nakanishi has shown that certain solutions have a 
negative norm! They are called “ghosts,” and it is unclear whether they result 
from the inadequacy of the approximation or from a deeper inconsistency of 
the theory. 

The previous study may be generalized to the inhomogeneous equation for 
the amplitude, in particular to high-energy behavior in the crossed channel (cor- 
responding to the exchanged particles). It is possible to find a Regge behavior 
s* with a computable trajectory function a(t). 


Various attempts have been made to introduce an effective potential (or quasipotential) approxima- 
tion to reduce the number of degrees of freedom of the relativistic bound-state problem, in particular 
the troublesome relative time. Even though they lead to interesting practical results, they mutilate 
one way or another the exact theory and generally introduce spurious singularities. 


10-3 HYPERFINE SPLITTING IN POSITRONIUM 


It should not be concluded that relativistic weak binding corrections cannot be 
obtained for two-body systems that agree with experiment. On the contrary, the 
positronium states give an example of a successful agreement. This will serve to 
illustrate the theory. To be completely fair, we should admit that accurate pre- 
dictions require some artistic gifts from the practitioner. As yet no systematic 
method has been devised to obtain the corrections in a completely satisfactory 
way. 

We quote here some of the significant results and refer to Secs. 2-3 and 5-2 
for preliminary investigations. Even though in the study of positronium we restrict 
ourselves to an almost pure electromagnetic system, some of the methods are 
useful in other instances such as models of quark bound states of hadrons. 

The energy difference between the higher triplet (ortho) and lower singlet 
(para) ground states of positronium, denoted respectively 1° Sand 1 So (im aie 
spectroscopic notation n?°*'L,), has now been measured with great accuracy. 
The values quoted for this hyperfine splitting AE,, are 


AE,, = 2.033870 (16) x 10° MHz 
(Mills and Bearman) 


AE,, = 2.033849 (12) x 10° MHz 
(Egan, Frieze, Hughes, and Yam) 


(10-79) 
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The interval is sometimes also called the fine structure of positronium. Recently 
Mills, Berko, and Canter have also measured the spacing between n = 2 triplet 
excited levels 


E(23S,) — E(2?P2) = 8624 + 2.8 MHz (10-80) 


We recall that all these states are unstable. The ground-state radiative width 
has already been discussed to lowest order (Sec. 5-2). 


We may understand the magnitude and sign of the singlet-triplet splitting In positronium by noticing 
that it corresponds to the sum of two effects. The magnetic interaction is given by the Fermi estimate 
discussed in the context of the hydrogen atom (Sec. 2-3-2). In terms of the electron-positron parameters 
with gyromagnetic factors equal to 2, it reads 


AE, = 


a 
ae |po|? <a, * 02) 


where | @o|? is the square of the nonrelativistic wave function at the origin for a system of two equal- 
mass particles |@o|? = (ma)?/8z. 

We have also to deal with a new effect corresponding to the annihilation channel, as was 
mentioned in the early discussion of electron-positron scattering (Sec. 6-1-3). If we restrict ourselves 
to the lowest-order effect it corresponds to the one-photon channel contributing an s-wave interaction 
energy of order « to the triplet state only because of its odd charge conjugation. The desired energy 
shift may be computed from an effective potential identified with the corresponding T-matrix element 
at the threshold (S = I + iT) multiplied by |@o|?. From Chap. 6. the scattering amplitude at the 
threshold is 


3 v(2)y"u(1) u(1)y,v(2) 
(2m)? 


We have been careful about the signs and u(v) denotes the electron (positron) spinor at the threshold 


a= (*) (1) = (x3, 0) 12) = CF) = ( i .) 
0 —id2X3 


The matrix element reads explicitly 


eo ee 
ie yi (—io2)oyx1 + x{o(—io2)(x})" = ant (; xdyoxtya + 5 Ubox2 : ster.) 


where we have used some Fierz reshuffling. Therefore we find 


e? 34+ 6;' 62 
ENE) ae |@o|? Pesce) 


The term <(3 + 0; : 6)/2> is nothing but S? where S = (a, + ¢,)/2 is the total spin; thus it contributes 
a positive amount to the triplet energy only. To lowest order, the total ground-state spin dependence 
to the energy displacement is thus expected to be 


e? 61-0, S 
ING. ts AVS 2 |e 
(Ep + Ea) => |@o| =a 


To obtain the hyperfine splitting we subtract the values corresponding to § = 1 and 0. Thus 


7 
AEts = zo? Ryd (10-81) 
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where Ryd is the Rydberg constant 


mo? 


Ryd = = 3.28984 x 10'S Hz (10-82) 


2 


Eq. (10-81) predicts AE = 2.04 x 10° MHz, which compares favorably with the experimental results up 
to order a. This is fortunately two orders of magnitude greater than the two-photon decay rate of the 
singlet state ',, = «>? Ryd. Hence the computation of the hyperfine splitting will be an accurate test 
both of quantum electrodynamics and of the application of the Bethe-Salpeter equation. 


10-3-1 General Setting 


Our presentation will be modeled after the original work of Karplus and Klein. 
The starting point is the analog of Eq. (10-26) written in the electron-positron 
channel. The bound-state amplitude y = <0|W(x)W(y)|P> stands for a4 x 4 matrix 
in the space of Dirac indices. It should be emphasized that y is a gauge-covariant 
quantity. Consequently, it is not indifferent to choose a particular gauge if x 
satisfies an approximately soluble equation. In practice, since relative velocities 
are small, we expect the nonrelativistic approximation to the binding energy to 
be a reasonable starting point. This is 


ma 


Ey om 
eee 


(10-83) 


and x is given by a Pauli-Schrédinger wave function. 

A noncovariant radiation gauge would seem the most appropriate to derive 
this result. This is, however, a dangerous choice since higher corrections will 
require renormalization. On the other hand, we know that to implement gauge 
covariance in a relativistic manner it is necessary to include in the kernel V an 
infinite series of crossed diagrams. We are facing a dilemma which will be 
resolved in practice in the following way. We will use a covariant gauge, to be 
specific the Feynman one, and will have to separate an instantaneous interaction 
from a retarded one. This is because we are unable to solve exactly the equivalent 
of the Wick-Cutkosky relativistic equation. The retarded interaction will then be 
treated as a perturbation on the same footing as higher-order terms in V. If we are 
led to smaller and smaller corrections which do agree with experiment, this 
procedure may at least be considered as a useful method in spite of its poor 
theoretical foundations. This will be true of the corrections of order «* Ryd. The 
development of the subject shows, however, that even the experts have some 
trouble with higher orders required for a comparison with the very accurate 
experimental resolution. 

Several improvements have been suggested such as expanding y in terms of 
Lorentz invariant scalar amplitudes multiplying covariant quantities, carrying 
out a complete angular analysis on the Wick rotated amplitude or attempting to 
find an equivalent but soluble form of the relativistic ladder approximation. 

Ignoring at first radiative corrections, we write the equation satisfied by y 
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in momentum space as 


2 oe Vix 


cm Be ; 
= 4 10-84 


_ 1 , t 
Val = 19 aie ee tr [y.xP’)] 


where Vs represents one-photon exchange in the crossed rt channel and V, is the 
one-photon annihilation interaction. Even at this early stage the complexity is 
somehow frightening. 

It will be convenient to work with the quantity K obtained by acting with 
the antisymmetric charge conjugation matrix C on the second index of 7: 


Kap = Cap Xap’ (10-85) 


This amplitude has the same transformation properties as one corresponding to 
the particle-particle channel except that the latter has no electromagnetic bound 
state. It satisfies 


(E+5-m) (E-s-m) x = (Ve + V.)K(p) 
2 aid 2 


d*p’ 
VeK =i es | nee yinyoK(p’) (10-86) 


. aaa ; 
VK = -i Aaa YC [ep tr [y*K(p')C] 


We have appended subscripts to the product of y matrices in order to distinguish 
the electron from the positron variables. 

The structure y;,)p/k’, with k = (p — p’), arises from the use of the Feynman 
gauge. We can, however, split this interaction into an instantaneous and a retarded 
part in the total rest frame, P = (E, 0), according to 

viv — vit v2 -yey8 Gee Yi is) 


k2 or k2 k2k2 ke2 

The second term on the right-hand side contains both retardation and magnetic 

interaction. It will be treated perturbatively (we wish we could do better); so will 
be the annihilation contribution. 

The zeroth-order approximation for K is obtained by separating in Vs a 


Coulomb part Vo from a remaining part noted V,, according to (10-87). We write 
therefore in the instantaneous approximation 


=i i 


(10-87) 
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This suggests the derivation of an equation for 


p(p) = | aoe K p) (10-89) 


by dividing both sides of (10-88) by the wave operators and integrating over p°. 
Thus 


y(p) = 


_; “| a,0 (P22 + p+), (P/2 — p +m), 
4n3 (P/2 + p)? —m? + ie (P/2 — p)? — m? + ie 


d>p’ 0..0 t 
Soars oe pp’) (10-90) 


The ie prescription in the denominators follows from the derivation of the equation. 
The p° integral converges and may be computed by closing the contour in the 


upper half plane encountering poles at pp = + E/2 + w — ie where w = \/p? +m’. 
If we use the Dirac notations 8 = y° and a = fy and the definitions 


A(p)=a-p+ Bm 


_ o+ H(p) 


A*(p) ae 


(10-91) 


A=* = Ai (p)AZ (—p) 
the result of the integration over p®° takes the form 


pl al A~~ \ a [ d’p'o(p) 
LT (RSS: aka en Weal jars a 10-92 
aia” (. — E/2 * On =a) on | =p 


Note the equivalence between the two formulations of Feynman’s propagator 


s___btm a A“), A) }: 


 pt—m+ie |po—wmtie potw-ie 


EUS H(p)A*(p) = +oA*(p) 


Eq. (10-92) can also be written as an effective one-body equation 


See d*p’ 
[H.(p) + H2(—p) — EJe(p) = (A~* — A ats | Ee 


Denoting g** = A**@, Eq. (10-93) is equivalent to a set of coupled equations 


(p’) (10-93) 


i as, 
Qw— Ble" =A** | AE fo" (p') +o“ (p)] 


Qn? 


wtp Ao (p+ 0-8 10-94 
(2m + E)p” (p)= A | ele (p')+ oe “(p')] (10-94) 


o*-(p)=e *(p)=0 
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Neither (10-88) nor (10-92) is sufficient for our purposes since it omits the retarded 
part V, and the annihilation kernel crucial for the hyperfine splitting. When both 
of these will be included, we obtain the correct version of an equation originally 
discussed by Breit for such problems. 

To discuss this effect, we will satisfy ourselves here with the form of perturba- 
tion theory originally derived by Salpeter. Jt is designed to circumvent the diffi- 
culty arising from the presence of the energy E, quadratically in the differenual 
operator of the equation (and parametrically in the general kernel) and is suited 
to an instantaneous unperturbed interaction. 

Returning to Eq. (10-88) we multiply both sides by 73 and obtain 


E E - 
E aie His || 5 ~por- H(i |x = — susan (10-95) 
0 4 d°p' r 
Depp) = a Pea oP) 


Assuming @ to be known and using the same ie prescription to invert the 
operator D acting on K we have 


K=D"} ab vO 
2it 
~ (10-96) 
Ate 


= 
a 2 LE/2 + po + (@ — ie)|LE/2 — po ¥ (w — ie)] 


Integration over po would lead back to Eq. (10-92). For shortness let us define 
7 in such a way that 


(H — E)n = vo (10-97) 


where we have set 
H(p) = Hi (p) + H2(—p) 
From Eq. (10-93) it follows that 
p=(AT* —A-~)y (10-98) 
and K is given by 
K =(H —- E) D-*y (10-99) 


For a fixed 7 corresponding to a given solution of the equation we define the 
amplitude Q through an expression similar to (10-99) but with the energy E 
replaced by E’. Recall that D also depends on E so that 


Q=(H— E) x D~ (En (10-100) 
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Consider the combination 


1 1 i 1 
D ioe = Ss 0 ) = Si) == 7 0 =F) —— yan 
as | »|o (HE) = 1 — [a mel) DE 
On the left-hand side when E’ reduces to E and Q to K we find zero. On the 
right-hand side the only dependence on the variable p° is in the propagator 
D~*(E’). The integral is the same as above, leading to 


” 1 , eal 1 y 
Et )— 7 | ao° ro =(H— E’) me v(A'*—A jy 


1 f 
The last equality follows from Eqs. (10-97) and (10-98). Similarly, if 
Ps i 
Q=n"'(H — E') ——D“‘(E) (10-102) 
2in 
the hermiticity of H and v leads to 
4h y I (8) Fain 1 art 
OD(E’) ae |e Ov = Fe n'(E’ — E) (10-103) 


We are now in a position to discuss the effect of a perturbation 
bV = yh y2(V, + Va) (10-104) 
to the instantaneous Coulomb interaction. The new wave function K’ and energy 


E’ will satisfy 


|e) — = | dp® v — ov |x = (10-105) 


We multiply this equation to the left by O, defined as above relative to E’, and 
integrate over the four-momentum p. From Eq. (10-103) we obtain 


Uo | d*p n*(p)p(p) = —2in [ap O(p)(6VK')(p) 
This is an exact result. To make it useful as a first-order result we simply replace 
gy’ by g and K’ and Q by K. Thus 
i 
(2x) 


provided we use the normalization 


3 a > 
(= n*(p)o(p) = (<2 [le**(p)|? — |e" @)|?]=1 (10-107) 


AE = — 


| d*p K(p)(6VK)(p) (10-106) 


For future purpose we shall also need the second-order energy displacement. 
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It involves in principle the inverse of the complete propagator La= I/2in x 
J dp° v]~ +. For the present applications we can, however, satisfy ourselves with 
the free two-particle propagator so that 


— {ep (KV)(p)D- (Es p(OVK)(p) (10-108) 


Investigate the normalization condition from the point of view developed in Sec. 10-2 and obtain 
its relation with this perturbative expansion of the bound-state energy. 


10-3-2 Calculation to Order «> 


Focusing on the hyperfine problem means that we deal with the spherically 
symmetric n = | states. With an accurate enough solution to the unperturbed 
Eqs. (10-92) or (10-93) we may now compute the shift. To lowest order it will 
be made of the two contributions 


1 = Po)? 
AEW) = =m . = | d*p' Rol: fy gee 9s [RD 
(10-109) 
eee eit 4m pir | todty EK (PVC ] tr [Cy Kip) | 


with K standing for Ky?y9. 

With the inclusion of radiative corrections we intend to obtain AE to order 
a> while the leading term is of order «*. It is therefore desirable to find a suitable 
approximation for the instantaneous wave-function @ solution of Eqs. (10-92) or 
(10-93). We recall that the corresponding Schrodinger wave function satisfying 


m a d?p' 
PSP) = a mah In? pane Qs(p’) (10-110) 


that is, with E given by the lowest-order approximation (10-83) and w expanded 
to order p”, is 


7) eee 10-114 
Os\p) = (p? ae m07/4)? Po ( a ) 
where @p is the wave function at the origin in configuration space 
mo 
[go]? = So ait (10-112) 


If this expression [multiplied by the product of two spinors at rest, ie., of the 
form (4) ®@(>) which will not be written explicitly] is inserted into (10- 96), it 
yields an approximation which is sufficient for our purposes: 
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Att 
K Po 
ee 2 [Eo/2 + po + (@ — ie)][Eo/2 — po ¥ (w — ie)] (p? + m?a2/4) 


(10-113) 


Here Eo is given by Eq. (10-83). Therefore to the required accuracy 


ABD _ 20° eel | d*p d*p' 1 
n> (2n)> }(p—p’* + ie (p? + m2a2/4)(p’? + m?a2/4) 


«(5 ‘aaa = a | 
[Eo/2 + po + (w’ — ie)}[Eo/2— po Fw’ — id] | (P—pP 


At a 

LTE pd SGI te) LE = po F (o =) ae 
The bracket denotes the expectation value between the product of spinors at rest. 
We observe in front of (10-114) a factor «°. Therefore the terms that we intend to 
compute can only arise from the contributions of order 1/«? or 1/« to the integral. 
From the presence of the denominators [p? + m?a?/4][p’ + m?«?/4] it is clear 
that the small |p|, |p’| region yields the dominant term and that any factor |p| 
or |p’| will contribute an extra factor «. In the expectation value we find the 
retarded Coulomb interaction and the Breit interaction a, - «2. In the former, with 
a relative error of order «*, we may replace (po — po)” by (p — p’)’, thereby 
suppressing the photon denominator. The po and po integration can then be 
performed and the spin-dependent effect is seen to involve four powers of the 
three-momenta in the numerator. It is therefore negligible to the order «°. 

We come next to the Breit term in «,-a. It is convenient to introduce 
the relative time variable by Fourier transforming the wave functions. We rewrite 
the bracket term: 


~ dpo dpo 2n dt dko ant af 
SSS a, ne, = pe erate ean 1 — ss ie iy D t 
iF —p)+ie \z mot. {D7 *(—1)(—41 + &2)D~*(t)> 
(10-115) 


where k stands for p — p’ and 


£182 Apo eat . il 
DN » . |e 2n : E6/2 + po — e1(@ — ie) [Eo/2 — po — &2(w — i8)] 


Ate 
= Hey es 


+ [0(e2)6(—2) — 0(—e2)0(t)] e*22-F)} (10-116) 


{[0(e1)0(t) — O(—e1)0(—1)] eT "1? Fl? 


In this sum the only Die with a dangerous denominator corresponds to A* * 
since Ey — 2 ~ —(1/m)(p* + m?a?/4) and the small p* region is Ce naue in 
the integration to come. If we were to look for the dominant o* contribution 
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it would be sufficient to keep the term multiplying the projector over positive 
energies. However, to reach the next coun we are forced to use the com- 
plete expression. Contributions of order p* may fortunately be ene om is 
so because they will generate a correction of relative order x? provided p? 
associated to a factor insuring a sufficient ultraviolet convergence. alee 
Eo whenever possible by 2m a satisfactory approximation for D~ ’(t) is 


See im %,°~p (1-2-8) 
= 2o(p? + m*a?/4) (: + om 2m 


x [(@ + m) e-‘t@-™ — (@ — m)e-ilH@+m] (10-117) 


Using the same principle as before and keeping in the matrix element only the 
term responsible for the hyperfine splitting we find 


a °p' Sep), : %°~p eS 
((i+ 2m J 2m ) i 2)(1+ 2m nt 2m ) 
k? 
2 


— ko,+k)+(62 — ko2+k)> 


The remaining part of the integral being rotationally invariant the right-hand 
side may be further replaced by —(k?/4m?)3<o, ‘62>. Inserting those expressions 
into (10-115) and performing the t integral yields 


3 30 
qe ks Ole. >| - d°p d*p' dko 


3 (2n)3 2 4. 47m? /4)?(p’? + x?m?/4)7(ko — k? + ie) 


i i 
6 = eo : 
lea +2m+ie ko toro ==; 


The integral over ko is easily obtained by closing the contour in the upper half 
plane leaving 


4n 3 
AEWD = an)? | Po]? <or “o> | 
ik | (w + m)(@' + m) A (cow’ — m?) id (w — m)(w’ — m) 
4ow'Lwt+o'+|k|-2m  “wto'+ {kl ° o+o' + [kl] + 2m 


d?p d°p’ 
(p> + a7m?/4)7(p'? + x7m?/4)? 


x 


(10-118) 


The last term can be omitted since the integral converges when we set 2” equal to zero in both 
denominators. We use the identity (@ + m)(@’ — m) + (@ — m)(w' + m) = 2(wa' — m?) and the sym- 
metry of the integral to drop in the middle term one of the a? in the denominators, therefore 
obtaining 


| d*p d*p' (@' + m)|k| 
p°(p? + m’07/4)? 4(o + mow'(o + w' + |k}) 


If we rescale p’ — (ma/2)p' and p> mp and keep the dominant « "| contribution this yields a simple 
integral 
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a) [-3) d oe) rtd 3 
el eres per ere ee es ae 
mea 0 /l+p? (1+ /1 + py(lt+p+/1 +p) Jo (1 +P”) ma 


The first integral in (10-118) contains the dominant effect. We write it as 


| d3pd>p' ‘ (w + m)(w' + m) [ky 
(p? + m?a?/4)?(p'? + m?a?2/4)? 4a’ w+’ + |k|—2m 
_f (2x)? a \ aaprad? p' (w + m)(w' + m) |k | i 
2 ma? (p? + m?a?/4)?(p’? + m?a?/4)? 4oo' w+’ +|k|— 2m . 


We cannot apply here the same method as above in a straightforward way. If we were to do so 
we would generate an extra infrared divergence from the energy denominator (w + w’ + |k| — 2m). 
To overcome this difficulty we introduce artificially a photon mass y, thereby replacing |k| by 


vk? + y? in the denominator. If is such that mx/2 «1 «m we can proceed as before. The remain- 


ing integral is then easily evaluated as 
(2n)3 m 
ae | ln 
mo mM 


Collecting our results we have 


5 
AE” = 55 |90|? <a: +62 -2( +5n zm) (10-119) 
The «* contribution agrees, of course, with the original estimate. The next correc- 
tion contains a spurious infrared divergence due to our way of estimating integrals 
(it is, of course, correct with » chosen as explained above). When all terms of the 
same magnitude will be collected this In (m/y) should (and will) disappear. 
Instead of proceeding directly to the evaluation of the annihilation contri- 
bution we shall turn to the second-order effect to exhibit the announced can- 
cellation. It would, however, be foolish to compute the second-order effect of the 
V, potential without introducing the correction to the Bethe-Salpeter kernel due 
to the crossed exchange of two photons (Fig. 10-7b) to maintain gauge covariance. 
We denote the sum of these contributions by AE}}),,. In applying Eq. (10-108) 
we are going to make a further simplification. We observe that in this relation 
we have replaced the intermediate propagation of the electron-positron pair by 
the free propagation. This amounts in Fig. 10-7a to neglect the instantaneous 
interaction between the two V, interactions. In the same spirit we may substitute 


(2) (b) 


Figure 10-7 Second-order exchange: (a) the retarded interaction V, to second order and (b) crossed 
photon exchange. Broken lines represent the instantaneous exchange, notched lines the retarded one, 
and wavy lines the covariant photon propagator. 
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for K (and K) its zeroth-order approximation, i.e., the wave function at the origin 
in configuration space 


K(p) > | d*p K(p) = (2n)°o 


In short, 


6 Aion? . d*k ye yoks ) 
AEG ea (Qn)? ry =|9o| lw seat te k2 Yr ye 


? a et — Fem (a — ) 
[(ko + m?) — w? + ie] [(ko — m)* — w* — ie] “~“ 


yi(P/2 + K+ mr yivy2(P/2 + k + mzyan\ 
[(ko + m)? — w? + ie]? 


ob 


(10-120) 


The second term is, of course, the crossed photon exchange. The notation w is 


for a k* + m? and P has been taken here equal to (2m, 0). We proceed to the 
matrix algebra by taking into account the fact that a spherical average over k 
may be performed. Thus the part contributing to the splitting is 


k2 k2 
(i 2 — 1° i) (f ++ m) (f —+ m) Ce 2 — yy" i2)) 


> 361° 02> ko ~ 
av 


(i (5 +K+ m) Vivy2 (£ +K+ m) 12x) 


ee 2 Co; : o> (3k3 = 2k?) 
av 
and we are left with 


Sia? dk 
AES ne <o -e:) (" ak | — 
b+ lol? <o1 + 2 ‘oa ae 


c } ey ri — 2k? 
[(ko + m)* — w? + ie] [(ko — m)* — w? + ie] © [(ko — m)? — w? + ie]? 


We reinstate a photon mass p and, using similar techniques as above, we find 


AEs, a = —> B\,,\? Mons «>(3 + In =) (10-121) 
2u 

The origin of the infrared divergence lies here in the use of a free two-particle 

propagator, but as expected this approximation is justified since the In (m/y) terms 
cancel between (10-119) and (10-121). 

Second-order radiative corrections AE} modify the vertices and the V, poten- 

tial through vacuum polamizaiion. The latter correction does not affect the singlet- 

triplet splitting to order «° since it is a short-range effect while the former may 
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be taken into account by including the anomalous magnetic moments of the 
electron and positron, ie., by multiplying the dominant term in (10-119) by 
(1 + «/2z)*. Thus to the required order 


2a? 
3m? 


We turn now to the annihilation part AE{) in Eq. (10-109), where the replace- 
ment of P* by 4m? in the denominator was justified for the present calculation. 
A new difficulty arises here since in this contribution we implicitly encounter 
part of the vertex charge renormalization by including the Coulomb wave function. 
This is clearly indicated in Fig. 10-8. Care must be paid to the way in which the 
subtraction is carried out, because of the noncovariant splitting of the one-photon 
exchange potential. To restore the covariance of the procedure it is necessary 
to include the second-order terms V,D~!'V, + V,D~1VY,. This has the effect of 
completing the photon propagator to its covariant form. As a matter of fact, it 
follows from the approximation of a free two-particle propagator in the second- 
order energy displacement that the V, potential acts immediately before or after 
the annihilation vertex. Of course, the leading «* contribution is insensitive to 
this effect. Thus we shall directly combine AE’ + AE). 5a into 


AER = 


| Po|? <o1 + 62> (10-122) 


a 


(1) (2) 
a + AEg+oa = - ——TT 
a eos 4n(22)*m? 


2 [ate tr ([Rlp}) + ARI.) 
x [ep tr {Cy“[K(p) + AK(p)]} 


d*p tr [Cy* K(p)] = 4num? a tr Cy" Il aie 1° ") 
. co(p? + m?o?/2)° 2m 


2 
%2°p P 
‘ (: 2m )+ =, loo} 


tr [Cy"D~ *(p, P)(po/p? — %1 * #2) Po] 


ve 


[ep tr [Cy"AK(p)] = 2i« er 


pays tie 
tr [cy'D*, O)y1’y2vPo] (10-123) 
p — A? + ié 
| if ! J | 
i" Qe a” ! 
I 
bs (o) 


Figure 10-8 Annihilation diagrams: (a) lowest-order term and (b) second-order contribution of the 
crossed term V,D~1V, + V,D~*Va. 
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Note that this time in K(p) we have kept the complete two-particle free propagator, 
while @o is still to be understood to involve the product ¥; © 72 of two-component 
spinors describing the polarization of the state and hence appears as a matrix 
under the trace sign. In AK(p) we have approximated the Coulomb wave function 
by its value at the origin in momentum space and used an infrared cutoff pu. 
The vertex renormalization is obtained as in Chap. 7 by subtracting the similar 
expression at a large photon mass A and using the renormalization constant 
Z, = Z, in the Feynman gauge computed by the same method as in Eq. (7-34): 
Z=1-2(inA+2mH+2) (10-124) 
2n m m 4 
It was already noted in the previous subsection that only space-like values of 
virtual annihilation photon polarization (index jy) contribute to the splitting. 
Moreover, the charge conjugation matrix 


P 0 —id> 


is odd in the representation we are using, meaning that 


tr[Cy()i()2 Go] 
is to be interpreted as 


(0, 12 (—io2))[C~*( )aC yl 1] (**) 


Using this remark, a straightforward (but long and painful) calculation yields 


TO An 
AB A = m (: - *) tr (po yC)° tr (C yo) 


4 + : . By) 
alee (: = Eckel ~iosve8] “Lt iosdar 00 


a (: = *) oo <S*»> (10-125) 


The above expression has still to be multiplied by [1 — @(4m7*)] to include the 
radiative correction due to the vacuum polarization to order x. In this formalism 
this effect arises from the second-order V,D~! V, term 


: 8 
(4m?) = = (10-126) 


leading to an extra contribution 
TOL 


8a 
AE Ww = 3 (- 5] | @o|? <S?> (10-127) 


1 


INTEGRAL EQUATIONS AND BOUND-STATE PROBLEMS 507 


P/2 P/2 P/2 : P/2 
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Figure 10-9 Two-photon annihilation diagrams. 


The last piece of the puzzle is the two-photon annihilation term with (minus) 
its Imaginary part equal to half the singlet level width (Fig. 10-9). Its expression 
is found by adding a new term in the Bethe-Salpeter kernel. It is best to return 
to Eq. (10-84) and to approximate the wave function by its nonrelativistic value 
at the origin. thereby obtaining the leading x° effect in the even-charge conjugation 
channel. To keep track of various coefficients it is simpler to use the relativistic 
notation with u(1), v(2) being spinors at rest: 


2) __ ft 2, _ Ian d*k ia. 1 1 


ae 7 . 10-128 
x* u( ) @ P2 Si k—m yy yy ET ee ( ) 


Evaluation of the integral and of the spin matrix element leaves us with a coefficient 
proportional to the projector on the singlet state: 


AE? — wy =- - |po|2<2—S*)(2—2In2+in) (10-129) 


with the imaginary part giving 2, = mx°/2 as found in (5-128). 

We have now the complete value of the hyperfine splitting in positronium 
to order 7°. Taking Eqs. (10-119), (10-121), and (10-122) for the exchange channel 
and (10-125), (10-127), and (10-129) for the annihilation one, and subtracting 
the values in the S = 1 and S = 0 states, we find the Karplus and Klein result 


1m ed) 
AE. = : a* Ryd E — (F +2In 2) | (10-130) 


This value is not yet sufficient for a comparison with the experimental result. The reader will 
appreciate the effort needed to extract the x° terms. The difficulty lies partly in a good control 
of the recoil effects, 1.e.. of a correct treatment of the relativistic Coulomb wave functions, which 
leads in fact to terms of order x° In (1/x). Other effects such as vacuum polarization in the exchange 
channel also contribute to this term, so that the correction is of the type (7/2) Ryd [Ba* In (1/a) +C). 
The latest value, reported by Lepage in 1977, is B= —%. When added to (10-130) this would yield 
the theoretical prediction 2.033774 x 10° MHz, to be compared with (10-79). 

The Bethe-Salpeter formalism may also be applied to obtain radiative corrections to the decay 
widths, as well as to other systems such as the hydrogen atom. As an exercise the reader may 
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derive, within the present formalism, the excited spectrum of positronium to order o”: 


‘ 1 ilo? a? . 
Ey.ts = 2m + Ryd Sita erat eae LT 


~ Qn?” 32n* on 
| 
0 
= —6 — 
Eis “Sona | 
SLAY I eg (10-131) 
(L + 1)Q2L + 3) 
; 1 7 1 — 2510 \ 
= —-—— + 619 + ——- x J-————_\ Jel 
ie eee PEA L(L+1) 
_ GBL-) [Sa 
LOL—v 


How does this compare to the triplet splitting of Eq. (10-80)? 
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earlier work. 
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CHAPTER 


ELEVEN 
SYMMETRIES 


The importance of symmetries in quantum systems and particle physics has 
led to a number of specific investigations. Model building and understanding of 
the fundamental interactions require the identification of the various invariances 
and the way in which they happen to be realized, approximated, or even violated. 
This is supported by the spectacular results of unitary symmetry, current algebra, 
and chiral invariance, and by the developing interest in the quark substructure 
of hadrons. On the other hand, similar considerations are at the heart of the 
theory of order-disorder transitions in macroscopic media, emphasizing once again 
the deep relationship between field theory and statistical mechanics. We present 
here some of the models, the study of which has brought to light specific field 
theoretic phenomena such as almost degenerate multiplets, spontaneous symmetry 
breaking, and quantum anomalies. 


11-1 QUANTUM IMPLEMENTATION OF SYMMETRIES 


We have already encountered several instances of symmetries. Some were discrete 
such as parity, time reversal, and charge conjugation. Others were continuous, 
as is the kinematical Lorentz invariance, or even space-time dependent, as in the 
case of gauge invariance. We shall now concentrate our attention on continuous 
internal symmetries and discuss the realization of quantum symmetries within 
lagrangian field theory, even though some of the statements and results do not 
require such a specific framework. 
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11-1-1 Statement of the Problem 


Consider a theory invariant under a continuous group of transformations at the 
classical level. More precisely this means that the lagrangian (or hamiltonian) 
and hence the equations of motion are invariant under these transformations, as 
discussed in Chap. 1. The question is now to clarify the way in which this property 
is reflected in the quantum system, and in particular to study its consequences 
on the spectrum of states. 

In the quantum framework the discussion of symmetries implies the existence 
of a group of transformations (local automorphisms) acting on the physical 
observables and therefore on the dynamical field variables. In infinitesimal form 
we are thus led to the construction of associated four-vector current densities and 
integrated charges. The point is then to know whether we can implement these 
transformations by unitary transformations on the Hilbert space of states. Locally 
we want to study the current conservation, while globally we ask whether the 
charges generate unitary operators commuting with the S matrix. 

In Chap. 1 we have already described within the classical theory how we 
can associate to a family of infinitesimal transformations a corresponding set of 
Noether currents. Given the field transformations 


P(x) > P(x) + dG(x) 


(11-1) 
G(x) = Fa(x)da" 


depending on the infinitesimal parameters 6x’, the currents are defined through 


aL 5o(x) 


a(64h)  da2 i) 


Jilx) = 
where ¥ is the lagrangian. This holds whether or not the action is invariant 
under these transformations, provided the lagrangian is a function of the fields 
and their first derivatives only. Using the Euler-Lagrange equations of motion the 
variation of Y reduces to 


OL = 0" fou" (11-3) 
The charges 


Q*(t) = [ex Jo(x, t) (11-4) 


are then the generators of these transformations through the Poisson bracket 
operation 

op 
ba? 
Equation (11-5) assumes that the Poisson bracket of @ with 6@ vanishes. This is 


the case if 6g does not depend on the conjugate momentum to @. In particular, 
the quantities Q(t) satisfy the commutation relations dictated by the Lie algebra 


{Q°(t), o(x, )} = <2 (x, 1) (11-5) 
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of the transformation group with structure constants C”. [see Eq. (1-146)]: 


{Q%(t), Q?()} = —C”.O%(t) (11-6) 
According to (11-3) if # is invariant the currents are classically conserved : 
o*jn = 0 (11-7) 


and the charges are time independent. This can be seen using Stokes theorem 
and neglecting spatial surface terms. 

In the quantum theory we would like to reproduce the previous steps and 
in particular construct the conserved currents. However, since the latter are in 
general local polynomials in the fields it is not unlikely to encounter some 
difficulties. A preliminary regularization may sometimes be useful. It might, for 
instance, involve a slight relative shift of the arguments of the fields, the subtraction 
of divergent terms, ete. As a result. the validity of the conservation law may no 
longer appear trivial. In practice we do not always master the commutation rules 
of fields, and we may rather try to check whether it is consistent to assume 
that the conservation laws hold. order by order in perturbation theory, after an 
appropriate renormalization. This program is carried out by showing the equi- 
valence with a set of identities to be fulfilled by the Green functions. These are 
similar to the Ward identities of electrodynamics and can be tested perturbatively. 
In the example of the chiral symmetry of the o model of Gell-Mann and Lévy 
we shall discover that anomalies due to ultraviolet divergences may invalidate 
the classical conservation laws. This kind of study is therefore not academic. 

It is also necessary to consider the global aspect of the problem, namely, 
the unitary implementation of symmetries and its effect on the spectrum. Two 
cases arise. The first is the most familiar one and has been studied in detail since 
the early works of Weyl and Wigner. It corresponds to a unitary representation 
y — Utg) of the symmetry group in the Hilbert space of states. Its action on the 
observables is given by 


A—%A = U(g)AU‘(g) (11-8) 


Such was the case of the Poincaré invariance. This involves the discussion of phase 
factors leading to representations of the simply connected covering group (Wigner’s 
theorem). The states are classified in multiplets corresponding to irreducible 
representations. In particular, the vacuum or ground state corresponds to the 
identity representation. Symmetries sometimes rely on approximations, such as 
neglecting weaker interactions. This is the case of the isotopic invariance which 
is often said to be weakly broken by electromagnetic forces. We shall briefly 
review this situation below. It may, however, occur that the symmetry is only 
reflected at the dynamical level and that the vacuum is not invariant. We call it 
the Goldstone realization. The unitary operators U(g) do not exist, and the 
symmetry appears spontaneously broken. States are no longer classified in multi- 
plets and under certain conditions massless particles appear. This situation is not 
infrequent in physics, an example being ferromagnetism and spin waves. In particle 
physics the dynamical behavior of pions seems to approximate this scheme. 
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11-1-2 Behavior of the Ground State 


The symmetry propertics of a system may be characterized by the behavior of its. 
ground state. As we shall see, if the vacuum is annihilated by a set of charges, the 
corresponding currents are conserved and the symmetry group unitarily imple- 
mented. Conversely, assuming conserved currents, if the vacuum is not invariant 
the symmetry is spontaneously broken. 

Were the vacuum not unique, we could define an orthonormal basis in this 
lowest-energy subspace, diagonalizing all commuting hermitian observables. To 
simplify, assume a discrete set of such states |n >. Consider any couple of observables 
A(x), B(y). When their arguments are infinitely space-like separated, only the 
intermediate translationally invariant ground states should contribute to the 
matrix elements of their product, according to a generalization of the Riemann- 
Lebesgue lemma 

lim <n| A(x)B(O)|m> = ¥. <n| A(O)|p><p| BOO) |m> 
|x| 00 P 

Here |p» runs over the ground states. According to causality, the matrix element 
of the commutator <n| [ A(x), B(0)] |m> vanishes in the limit |x | - x. The matrices 
<n| A(O)|m> and <n| B(O)|m> commute and can be simultaneously diagonalized. 
Within a given sector the vacuum matrix element of a product of local operators 
factorizes when their space-like separations become large. This is called the cluster 
property. x 

An apparent degeneracy of the vacuum can occur as a result of a poor 
approximation. The real ground state is the unique combination which minimizes 
the energy. An example is provided by a quantum mechanical one-dimensional 
system with a potential V(x) = (x? — 1)? with two equal minima. Quantum 
mechanics allows for barrier tunneling and leads to a unique symmetric ground 
State. 

We may approach the ground state in a translationally invariant way by 
requiring it to minimize the effective potential V.4(@). The latter plays the role 
of density of potential energy in a state with a given mean value of the field. 


Let us quote an important property which follows from locality. Assume that a conserved current 
operator j"(x) has been constructed, so that ¢,,j"(x) = 0. Define the integral of j°(x, t) over a bounded 
region V of space, Qy(t), as 


Ov(t) = i deaie (xt) (11-9) 


This operator clearly stands a better chance of being well defined than its limit Q(t), the integral 
over all space. The commutator of Qy(t) with a local observable A is independent of time for V 
large enough: 


eed 
lim = [Ov(t), 4] = 0 (11-10) 


V0 


Indeed, from current conservation we have 


d 
0= | d>x [0,j"(x, 0), A] =— | Px 0), alee | dS « [j(x, 0), A] 
Vv dt Jv 5 
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When V becomes large enough, the surface integral vanishes since the commutator involves local 
operators separated by a very large space-like interval. 

The validity of this statement extends to more general situations. In particular, A may be 
replaced by any multilocal operator. The proof could even be extended to a nonrelativistic system 
provided that only short-range forces are present. 


What seems an utter triviality -either the vacuum is invariant or it is not— 
covers, however, a wealth of situations with very different physical content. What 
ifthe vacuum is indeed invariant? A theorem of Coleman shows that the associated 
currents are then conserved. Assume that Q(t), the space integral of j°, is well defined 
(at least on a dense subset of the Hilbert space including the ground state) and 
annihilates the vacuum 


Q(t)|0> = | ax ite t)|0> =0 (11-11) 


If the theory admits an energy gap. any state with zero momentum will be such 
that 


<n, P, = OJ jo(x, t)|0> = 0 


from translational invariance. since the above quantity is x independent with a 
vanishing integral. Here we cheat slightly since strictly speaking such a state is not 
normalizable. The necessary refinement may, however, be worked out. Conse- 
quently, 


<n, P, = 0| 8oj°(x, t)|0> = <n, P, = 0} 0,/4(x, 1)|0> = 0 (11-12) 


The last equality follows from the fact that V- j(x, t) = i[P, j(x, t)]. Assuming 
the matrix element of j(x.t) to be meaningful the matrix element of its three- 
divergence must vanish between zero momentum states. Finally ¢,j"(x) being a 
Lorentz scalar, Eq. (11-12) generalizes to arbitrary matrix elements between the 
vacuum and any other state. Hence 0, j“ annihilates the vacuum. One may show 
that this is only possible for a local operator if it vanishes. 

We conclude that 


é,j* =0 (11-13) 


The symmetry is exact and unitarily implemented by operators of the form 
U = ei? 


11-2 MASS SPECTRUM, MULTIPLETS, AND 
GOLDSTONE BOSONS 


11-2-1 The Octet Model of Gell-Mann and Ne’eman 


When the internal symmetries form a compact Lie group, we may choose the 
phases of the representative operators to obtain a unitary representation of its 
covering group. This representation splits into irreducible components, acting on 
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subspaces of the original Hilbert space. The latter is therefore generated by 
multiplets of states. 

Here we shall briefly describe the SU(3) symmetry or octet model valid with 
some degree of approximation at the level of strong interactions. It generalizes 
the familiar isospin invariance of nuclear forces introduced by Heisenberg in the 
1930s. 

Larger flavor groups, as they are called nowadays, manifest themselves at 
higher energies. There is already some evidence for an SU(4) symmetry through 
the study of the new multi-GeV narrow resonances, and it may well be that this 
is not the end of the story. 

The members of the elementary isodoublet, proton and neutron, are classified 
according to the eigenvalues of the third component of isospin 73 with eigenvalues 
+ in such a way that 


Q= +5 (11-14) 


Here N is the baryonic charge. In the lagrangian framework, the corresponding 
fields will form an isospinor yw and, neglecting mass differences, the kinetic term 


Lo = Wid + My 
is invariant under isospin rotations 
“>Up yout (11-15) 


with U a 2 x 2 unimodular unitary matrix. In the simplest case we couple this 
field to the pion isovector in an invariant way (see Chap. 5). For the pion, a 
relation analogous to (11-14) holds with N = 0 and 73 taking the values +1, 0, 
or — 1. Using the pseudoscalar character of x the only (renormalizable) invariant 
interaction is 


Pint = igannWy°t ‘ Tay (1 1-16) 


The boldface characters used for the Pauli matrices or the pion field refer to 
isospace. Equation (11-16) expresses in a compact form a set of relations on the 
various couplings 


Gxt pn = / 2Gshpp = —/2Gn'nn (11-17) 
asserting the dynamical content of the symmetry. These relations were obtained 
by noting that the usual fields 2), 2°~’ = (x), and x are related to the 


cartesian coordinates through 
a(t + 03 + 103) = nal) + $(O" 
n°) = 15 (11-18) 


1 
= We (11 + in) 
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ee 1 

7 2 
ee Isopin 5 (nucleon) 

n Pp 
———————Y—— Isopin 1 (pion) 

an ax 
a i ee Isopin 3 (A resonance) Figure 11-1 Weight diagrams for 
= AY RE Re SU(2). 


Note that 2‘~) creates a positive pion and annihilates a negative one. When 
discussing pion-nucleon scattering we mentioned some dynamical consequences 
of this symmetry such as triangular inequalities on cross sections. In the early 1960s, 
Gell-Mann and Ne’eman succeeded in obtaining a generalization of isospin 
symmetry to a larger group SU(3). More precisely, all known multiplets correspond 
to representations of the factor group SU(3)/Z3 where Z3 is the abelian center 
of SU(3) generated by the cubic roots of the identity. The latter therefore acts as 
the identity on all hadronic states which are called of triality zero. 

We recall that the representations are conveniently described in terms of the 
Lie algebra of the infinitesimal hermitian generators of the group. The procedure 
is familiar in quantum mechanics. We diagonalize a maximal set of commuting 
generators (Cartan subalgebra). Basis states are represented by weight vectors 
with components equal to the eigenvalues of these operators in a space of 
dimension equal to the rank of the Lie algebra, i-e., the dimension of its Cartan 
subalgebra. For the familiar example of SU(2), of rank one, weight diagrams 
are one-dimensional, the abscissa being the eigenvalue of TJ; (Fig. 11-1). A 
particular role is played by the adjoint representation acting on the Lie algebra 
itself through the linear action of commutation. In the case of SU(2) it corresponds 
to isospin 1 with 


lI 


Ts 
. [T, ta] 


T, + iT, 
amt in 


(11-19) 


This extends to SU(3) with eight (3 x 3 traceless hermitian) generators and a 
Lie algebra of rank two. The two diagonal generators are linear combinations 
of iospin 73; and hypercharge Y, equal to the sum of baryonic number and 
strangeness. The Gell-Mann and Nishijima formula gives the relation with the 
charge generalizing Eq. (11-14): 


The lowest-dimensional octets of baryons and pseudoscalar mesons (correspond- 
ing to the adjoint representation of SU(3)] as well as the resonance decuplet 
which led to the discovery of the Q~ particle are shown in Fig. 11-2. 
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Figure 11-2 Meson and baryon octets and resonance decuplet in the Gell-Mann and Ne’eman model. 


A basis of the Lie algebra generalizing the Pauli matrices is given by the 
Gell-Mann matrices 


01 0 Ue eee 1 00 
mt 5 me 0 7 a-(0 | 7 
000 0 00 0 “ee 
1 Oma 0 
| (0 0 7 Ag = , (112% 
1 0 io 0 0 


00 O 1// soe 0 
A,=|0 0 -i}] As=| 0 1/3 O 


0 i 0 0 0 —2/,/3 


5 
£ 
\ 
oes 
SS 2 cS (ame ee) 


as 
_- CO © 


These matrices are normalized according to 
tr (A;,A,) = 20% 
and fulfil the commutation and anticommutation relations 


[Aes Ai] a Dict An {Aus Ai} = 4641 a BY; Fey Ven (11-22) 
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with fim totally antisymmetric and dyim totally symmetric, with nonvanishing 
elements given by the following table: 


— 


k Im Zfkim klm 2dkim kim 2ditm | 

|e 2 mee 73 | 316.6 -1 

Les 1 1 4 6 1 spy —1 

rs 6 =I 15977 1 448 —1/,/3 

246 1 22S 33 558 1/3 (11-23) 
257 1 247 —1 668 —1/,/3 

s45 Bi ee 1 fas = 1/./3 

367 ~1 338 2/3 888 -—2/,/3 

458 ye 344 1 

678 V3 | 5536 a 


From the time of its introduction the unitary symmetry has remained puzzling 
by the lack of multiplets with nonzero triality. The simplest of those would 
correspond to a triplet of hypothetical quarks, introduced by Gell-Mann and by 
Zweig. These particles [an isodoublet (u,d) and an isosinglet (s)] would have 
fractional charges and baryonic number (Fig. 11-3). If the unitary group turns out 
to be larger than SU(3), more quarks carrying new quantum numbers would be 
required such as the charmed quark (c), an isosinglet of charge 2/3 and hypercharge 
1/3. The corresponding new symmetry group SU(4) is of rank three. The weight 
diagrams for its (four-dimensional) fundamental representation and (fifteen- 
dimensional) adjoint representation are shown in Fig. 11-4. 

Many-unsuccessful attempts have been made to detect free quarks, mostly on 
the basis of their fractional charge. It appears as if all known hadrons share the 
properties of quark-bound states. The idea that the latter might be permanently 
bound has led to the term “confinement” and several mechanisms have been 


Figure 11-3 The quark triplet. 


518 QUANTUM FIELD THEORY 


Figure 11-4 Fundamental and adjoint representations of SU(4). We have displayed the new charmed 
mesons. 


suggested. One of the most promising assumes an unbroken color symmetry 
according to a local invariance, to be discussed in the next chapters. 

The octet symmetry is only approximately obeyed by strong interactions. 
This is even more true of any larger invariance group. Nevertheless, the pattern 
of symmetry breaking looks very regular, obeying the so-called octet dominance 
rule. To quote but one example, consider the mass differences between members 
of the same multiplet, for definiteness the baryon octet of nucleons, A. £, and = 
particles. In an effective lagrangian, assuming only isospin invariance, the mass 
terms would have as coefficients my, m,, My, Mg. Each bilinear field product such 
as pp +nn+--: transforms under $U(3) as the direct product of two octet 
representations which can be split into irreducible parts according to the Clebsch- 
Gordan series 


8 @8 = 1'@8' 8" G10 @ 10 @27 (11-24) 


Neutrality means that the non-self-conjugate representations 10 and 10 are 
absent in the mass matrix. This implies that the four physical masses may be 
reexpressed as linear combinations of four parameters multiplying the combina- 
tions having definite transformation properties according to 


my = (m, — 2mg + mg — 3m27) 


my = (my — mg — mg + 927) 
: (11-25) 
My = (m, + mg + ms + m7) 


Mz = (m, + mg — 2m — 3m27) 


This reshuffling is, of course, of no interest unless some of the irreducible param- 
eters vanish. Suppose, however, that the mass splitting is generated in a more 
fundamental lagrangian by a quark-mass term of the type W(a + bAg)wW carrying 
only singlet and octet representations and that this property is (miraculously) 
preserved by the interactions. This would mean that m3, vanishes in Eq. (11-25), 
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leading to the Gell-Mann and Okubo relation 


2(my + mz) = 4(3m, + ms) (11-26) 


experimentally well satisfied with 
3(my + ms) = 1.128 GeV 4(3m, + ms) = 1.134 GeV 


For the pseudoscalar octet (A, 2,1) the analogous formula involves empirically 
square masses 


A4mx = 3m? + m2 (11-27) 


We should allow for some mixing with a ninth 7’ meson to complete a nonet. 
Using Eq. (11-27) at face value, it predicts m; = 0.320 GeV’, the experimental 
value being m? = 0.301 GeV?. 

Octet dominance applied to the resonance decuplet leads to equal-mass 
spacing, in good agreement with the observed data and considered as a major 
triumph in its prediction of the complete set of quantum numbers of the Qu 


particle: 
Mss — my = 152 MeV 


m= Mo + mg Y Mz — Myx = 149 MeV (11-28) 
Mg — Mz» = 139 MeV 
Unitary symmetry leads to numerous relations among scattering amplitudes 


which are in satisfactory agreement with experiment. We shall have more to say 
about it when we discuss the applications of current algebra. 


11-2-2 Spontaneous Symmetry Breaking 


Spontaneous symmetry breaking occurs when the ground state is not invariant 
under the transformation group. The latter acts in a larger nonseparable Hilbert 
space, as was exemplified by the behavior under rotations of the infinite ferro- 
magnet (Chap. 4). We can convince ourselves of this fact by the following heuristic 
remark. Let us try to compute the norm of the state Q|0>, where Q is the would- 
be total charge 


<0| Q?|0> = fex <0] jo(x)Q |0> (11-29) 
From translational invariance this is also equal to 
[ex <0| jo(0)Q |0> 
and is clearly infinite when Q|0> # 0. 
The most striking consequence analyzed by Goldstone is the appearance of 


massless particles when the broken symmetry is a continuous one. These states 
are generated by operators which would rotate the vacuum by an infinitesimal 
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amount to a degenerate vacuum, since on physical grounds such a transformation 
does not cost any energy. 

To prove Goldstone’s theorem we assume the existence of a conserved current 
and consider any operator A such that 


da(t) = lim <O|[Qr(t), A] |0> 4 0 (11-30) 


That there exists such an observable just expresses the noninvariance of the 
vacuum. Insert a complete set of intermediate states of definite four-momentum 
in this relation: 


da(t) = lim | d?x [<0| jo(0)|n><n| AlO>e *-"* — CO] A|ny<nl jo(0) OY e'”*] 
V>co n JV 


JolO) |O» eff] 


= ¥ (2n)°57(P,) [<0 jo(0) |n><n| A |0> e Et — €O| Ajn><n 


40 (11-31) 


We have already shown in Eq. (11-10) that current conservation implies 
d 
— da(t)=0 11-32 
5; Salt (11-32) 


Therefore 
Os Sy (27) 5°(P,) En[ <O| jo(0) |n><n| A |0> e- were Se <0| A |n><n| jo(0) |O> e!*,"] 
' (11-33) 


It then follows from Eqs. (11-31) and (11-33) that a state |n> must exist such 
that <O| A |n><n| jo(0)|0> # 0 for which E,,6°(P,,) vanishes. This is a massless state 
with the same quantum numbers as jo (and A) since it is generated by this operator 
from the vacuum. 

These massless states are called Goldstone bosons associated to the symmetry 
breaking. They are indeed bosons if jo is a boson-type operator. In a more general 
setting such as the recent supersymmetric theories j, could in fact carry half-integer 
spin and the associated massless particle would then be a fermion. In other words, 
the spin of these states bears a relation with the Lorentz transformation properties 
of j,(x). 

A subtle point is that the massless states need not necessarily be observable. 
This remark applies to theories with an unphysical sector of unobservable states 
(such as quantum electrodynamics in the Gupta-Bleuler gauge) and may be relevant 
when we want to avoid the conclusion of the theorem. 

The mechanism of spontaneous symmetry breaking is characterized by its 
aesthetic appeal, as opposed to the ad hoc breaking prescriptions such as those 
encountered when discussing the octet model. It is economical in the sense that 
it does not introduce any new parameter. It is also theoretically advantageous 
since it preserves renormalizability properties. As a counterpart the appearance 
of massless particles might be an unpleasant feature in certain applications. We 
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shall encounter in the next chapter an elegant way to dispose of them using 
Higgs’ mechanism. 


As an extreme example of spontaneous symmetry breaking consider a free massless scalar field with a 
lagrangian 


L = 4(0oy (11-34) 
invariant under the field translations 
(x) > P(x) + A (11-35) 
The associated conserved current is 
Hx) = 0"(x) (11-36) 


The vacuum is, of course, not invariant under these transformations. If we substitute ¢(x) for A in 
Eq. (11-30) we find 


5<0| o(x)|0> = 4<0|0> # 0 (11-37) 
Alternatively, the total “charge” 
lim Qy(t) = lim | d>x Aod(x, t) (11-38) 
V0 V0 JV 2 


is not well defined. The Goldstone boson in this case is the quantum of the field ¢ itself. An interesting 
point about this example is that we may look at the state obtained from the vacuum through the 
action of e'*2r. From the Fourier decomposition of the field 


ak : ; 

(x) = ies [a(k) gate “4 at(k) en] (11-39) 

it follows that in the infinite-volume limit the state 
|A> = lim e424 |Q> (11-40) 

V>ox 

takes the form 
3k : 
A> = lim exp 4d | d3x oie a(k) e*** — at(k) e—*°*] > |0> 
v- v 2(2n)° 

bs (11-41) 


= g(A/2)[a(0)—a'(0)] \0> 
It appears therefore as a coherent superposition of zero-energy momentum states. To show that 
<i|0> vanishes we have to be more careful in the handling of the infinite-volume limit. It is better 
to introduce a smooth spatial cutoff, for instance, 


Ov = | d°x dop(x, 0)e-*/V" (11-42) 
The reader can easily verify that 
{Ol A> = ea PAV /64 _,0 (11-43) 
Vow 


A physically meaningful model exhibiting spontaneous symmetry breaking is 
based on a set of n coupled scalar fields with a lagrangian 


2 


216) = 309 — 9? — 167 (11-44) 
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(a) (b) 
Figure 11-5 The potential V() = (u2/2)? + (A/4)(6?)?: (a) np? > 0, (b) n? < 0. 


invariant under an internal O(n) symmetry group. The notation is such that @ 
stands for the column vector of these fields and symmetry breaking will occur if 
the bare-mass parameter 1” is negative. At the classical level the ground state is not 
described by a zero mean field value due to the fact that the potential 


2 
ve = 6? +2067) (11-45) 


has a minimum for a nonvanishing value of the field (Fig. 11-5) 


|g|=v= (- eye (11-46) 


To insure the stability of the theory we assume, of course, that large values of @ 
are damped, which means 4 > 0. It appears as if the vacuum is degenerate: every 
state with |p| =v is a priori a candidate. However, any choice leads to an in- 
equivalent Hilbert space. Note that such a choice of a ground-state value @, will 
leave an O(n — 1) subgroup unbroken, corresponding to those transformations 
which act as the identity on @,. 

To be specific, let us pick a coordinate system in internal space such that 
$, = <0| @ |0> has only its nth component nonvanishing, and parametrize the field 
in terms of a radial displacement p and (n — 1) orthogonal rotations as 


0 


0 
P= eb sr) | (11-47) 


i ee 6. 
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The t, with 1 <a<n-—1 stand for n—1 generators of the O(n) group acting 
effectively on the vector @,. In other words, the Lie algebra of O(n) contains in 
addition to the O(n — 1) generators the n — 1 f,’s. When expressed in terms of the 
new dynamical variables p and & the lagrangian reads 


2 
A 
L = H0p) + HOE? — Fw + pP —F(0 +p) (11-48) 


From this we see that the € correspond to (n — 1) decoupled massless fields, as 
predicted by Goldstone’s theorem. 

In the previous example, to be studied in more detail in Sec. 11-4, the ground- 
state degeneracy is apparent at the classical level. We have learnt, however, in Chap. 
9 that quantum corrections can alter the potential, leading to the possibility of 
vacuum degeneracy as a result of radiative corrections. This appears to be the case 
for the following model for scalar electrodynamics, as pointed out by Coleman and 
Weinberg. A complex massless boson field is coupled minimally to an electro- 
magnetic field. The lagrangian is 


A 
L= Log + HOG: — eAdry’ + 302 + eAgiy — qi (oi + $3)" (11-49) 


The complex field @ is written in terms of its two real components ¢, and @2. 
Moreover, the anharmonic coupling constant is denoted 4/4! to compare the one- 
loop computation of the effective potential with the expression given by Eq. (9-129) 
for the lagrangian 


A 
L = 4(09)? - an p* (11-50) 
which led to the result 
Be as 2) 11-51 
Yaar? + Gem?" MP 6 aoe 


In Chap. 9 renormalization was performed at a value of the field equal to M to 
insure that 

d* 
= do* 
Returning to scalar electrodynamics, we can repeat a similar calculation using the 
symmetry factors shown in Fig. 11-6 for the contribution of df to V. The result 


A Vib) |o=m (11-52) 


si 1 1 1 1 1 
1 
1 i et 1 1 1 1 1 
1 2 


(a) (5) (c) (d) 


Figure 11-6 Relative weights of the one-loop diagrams for the Green function with four ¢, fields. 
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reads 


ee $7+3\ 25 
uot + 081m ( on ) A 


> = ae ve 
— +( /. arr 
(11-53) 


Whee’ 


The calculation was carried out in the Landau gauge. The last diagram, Fig. 11-6d. gives a vanishing 
contribution at zero momentum. A factor three in diagram (c) originates in the trace of (g*, — k*ky:k*). 
The remaining weights arise from the vertices. 


It is consistent to assume that / and e* are comparable. Indeed, the 4 coupling 
is generated by the extra divergence occurring at order e*. Therefore we may 
neglect 4? as compared to e*. In this way the potential V of Eq. (11-53) has a 
minimum for a nonvanishing value @2 = $2; + 2.2 given by 


yi e? \2 e2 \2 ge 
ee (a aa = -54 
2-u(2)]oa() no 115 


If we choose the scale of @ such that M? = @?, that is, 


V 
A=— 11-55 
dp* | 92-62 ae 
we find the relation i 
e2 2 
Tabb (=) (11-56) 


giving indeed a value for A of order e*. The parameters of the theory are now 
e and ¢,. 


A similar reasoning would not apply to the pure o* theory of Eq. (11-50). It is true that the 
potential V has a minimum for a value @¢, such that 


A 11)? i? 2 


Ss 1 = - 
Selo G4? ae ee 
that is, 
g2 
41 v Ee ce b) ig 
din 7 + 0() (11-58) 


However, the quantity 4 In (#2/M7) is now sizable and we expect higher quantum corrections to be 
nonnegligible. Stated differently, it is not legitimate in this case to set 6? = M2, since this would 
imply a large value of 4 for which the perturbative series cannot be trusted. 


For scalar electrodynamics spontaneous symmetry breaking does not imply 
the appearance of any massless boson. On the contrary, both the vector and 
the scalar particle acquire masses given by 
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(11-59) 


= O(e*) (11-60) 


The reason for such a behavior will be studied in more detail in the following 
chapter. 


We conclude this section with a discussion of the role of the dimensionality of space-time. A 
theorem due to Mermin and Wagner states that a continuous symmetry can only be spontaneously 
broken in a dimension larger than two. For a discrete symmetry this lower critical dimensionality 
is one. This is, in fact, well known since in quantum mechanics with finitely many degrees of 
freedom (corresponding to one-dimensional field theory) tunneling between degenerate classical 
minima allows for a unique symmetric ground state. Alternatively, we may consider the discrete 
analog of a field theory, the simplest example being the Ising model in statistical mechanics. Path 
integrals are replaced by sums of terms of the form e~ **? where E is the energy of a configuration. 
For the Ising model E = —J Non o,0; where the sum runs over neighboring sites on a lattice and 
the discrete “spin” o; takes values of +1. Such a model admits a discrete symmetry corresponding 
to reversing all the spins o, + —o,. This symmetry is spontaneously broken in the low-temperature 
phase below a critical point in dimension two or higher, but no transition occurs in dimension one 
(Peierls, 1938). 

A similar model called the classical Heisenberg model replaces the variables o; by unit vectors 
S; on a sphere. In this case a continuous O(n) group operates if S; has n components, and no 
spontaneous magnetization occurs below dimension three. 

The Mermin-Wagner theorem has been restated by Coleman in the framework of field theory. 
We may establish this property by showing that the spontaneous breakdown of a continuous 
symmetry would lead to a Goldstone boson. But in a two-dimensional space-time it is not possible 
to construct a massless scalar field operator. Indeed, the corresponding two-point Wightman function 


re 


a?k 0 2) pik: 
0] b(x)4(0)|0> = | — O(K°)5(K?) e* 


co 1 
. | an cos (k!x!) eit'* (11-61) 
0 7 


is a meaningless infrared divergent integral. No subtraction procedure may be devised to circumvent 
this difficulty without spoiling the fundamental properties of field theory, for instance, positivity of the 
Hilbert space metric. A massless scalar field theory is undefined in a two-dimensional world due 
to severe infrared divergences. In the statistical language, fluctuations overcome energy in destroying 
long-range order in this dimension. . 

A simple argument reveals the nature of this phenomenon. Let us use a discrete classical 
Heisenberg model on a lattice. Compare the two configurations shown in Fig. 11-7 where the 
orientation of the “spin” is allowed to vary along the direction of the first axis, say. The action 
is replaced by the energy proportional respectively to E, = —[t and to E, = —17~' Yi cos (0/L). 
The relative weight of these configurations is given by the Boltzmann factor 


eB( Es — Ea) ~ eo bPE*p 
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Figure 11-7 Two configurations for the classical Heisenberg model on a d-dimensional lattice. One 
of the directions has been singled out to show the effect of a continuous rotation of the average spin. 


For d larger than 2 we see that the (b) configuration has a negligible weight in the thermodynamic 
limit and for sufficiently low temperature, meaning that order is favored. For d = 2 averaging over 
fluctuations will destroy this order. Similar arguments may be presented for a discrete symmetry 
showing that the lower critical dimension is then equal to one. 


11-3 CURRENT ALGEBRA 


11-3-1 Current Commutators 


Weak interactions have led to consider in detail the structure and properties of 
hadronic currents. Phenomenologically these interactions are well described by 
an effective current-current lagrangian of the form 


G 
2 = ——- J*(x)J}(x) (11-62) 
V2 
where G is Fermi’s constant equal to 
G = (1.026 + 0.001) x 10> °m; ? (11-63) 


The total current J ,,(x) is the sum of leptonic (/,,) and hadronic (h,,) contributions 
J u(x) = L(x) + h,(x) (11-64) 


Disregarding the contribution of the newly discovered massive lepton, the leptonic 
current involves only negative helicities for the electron e~, muon p, and neutrinos 
Vesa 


Ix) = Welx)7o(1 — 9s)Wo.00) + Uuldyo(l — ysWr,020) (11-65) 


The hadronic current itself combines strangeness conserving h'45=° and strange- 


ness changing h‘45~) parts as 
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h, = cos 8, h'aS=° + sin 6, hAS=) — (11-66) 
where 6, is the Cabibbo angle equal approximately to 
0, ~ 0.25 (11-67) 


‘Such a decomposition implies that we can relate the scale of the various com- 
ponents associated to transitions with different quantum numbers. Such a scale 
will be provided by the nonlinear algebra of current commutators. 

Each of these currents is a superposition V — A of a vector and an axial 
part, as was the leptonic current. In the framework of unitary symmetry, they 
belong to an octet of currents denoted V%, A? (a = 1,...,8) with 


hiss=0) = (V4 — 1,2) — (Ay! — iA,?) 
hints — Or = iV,,°) = (Ae a iA,°) 


The reader should not be confused by the notation A, used here for an axial 
current and its meaning as a potential in electrodynamics. Both are traditional. 

The current-current weak interaction, supplemented by these hypotheses, 
leads to remarkable universality properties. Consider, for instance, the matrix 
element measured in neutron f decay: 


G<p| hAS= n> = apy,(Gy — Gays)Un (11-69) 


The form factors G;-and G4 on the right-hand side are evaluated at zero momentum 
transfer due to the smallness of the neutron-proton mass difference. Within 
1 percent the observed value of Gy coincides with the corresponding quantity 
measured in muon decay. The value quoted in (11-63) is the one extracted from 
muon decay taking radiative corrections into account. The most precise measure- 
ments involving strongly interacting particles are usually performed in allowed 
B transitions (0° +07) among nuclear states. Care is also taken to correct for 
various effects, such as radiative corrections, Cabibbo angle, etc. Some recent 
values are 


(11-68) 


MQ 4N  — -G,/G = 1.006 
2641-»26Mg Gy /G = 1.011 


The fact that Gy(0) is not renormalized by strong interactions finds a natural 
interpretation if we assume with Gell-Mann and Feynman that the vector current 
V,, is conserved and in fact generates the hadronic unitary symmetry. In other 
words, the V,,* (a = 1,2, 3) are the components of the isospin current and Lo) 
is approximately conserved if we neglect SU(3) breaking. The hadronic electro- 
magnetic current is therefore 


7em 1 
i =e A Ve 


where for the purpose of comparison we omit the factor e in the definition of 
j°™. This hypothesis is a generalization of the electromagnetic case, where current 
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conservation implies a universal charge renormalization. We recall from Chaps. 7 
and 8 that the Ward identity expressing the conservation law implied that the 
charge renormalization was entirely due to vacuum polarization. All fermion 
interactions resulted only in the wave function renormalization constant Z2 = Z,. 


Therefore some components of the vector hadronic weak current and the isovector part of the 
electromagnetic current are members of the same multiplet and simultaneously conserved. This 
conserved vector current (CVC) hypothesis of Gell-Mann and Feynman may be checked by comparing 
the weak and electromagnetic decay rates of members of the same isospin multiplet. In particle 
physics we find a prediction for the n° >x° +e’ +vandx —+7” +e + # transitions. Due to the 
smallness of the momentum transfer, the amplitudes are directly normalized, through an isospin 
rotation, by the corresponding matrix elements of the electric charge. This leads to the rates 


5 
ie Cay) = Se —— = 0.45 s7} 


where terms of order m2/(m,+ — m,»)*? ~ 10 7 have been neglected. This 1s to be compared to the 
experimental value 


Waa ap = (0.39 SE 0.03) s 


Accurate tests can also be performed in nuclear physics. 


In contradistinction, the axial current does not appear to be conserved, even 
in the limit of an exact SU(3) invariance. Consequently, the axial coupling constant 
as measured in f decay is notequal to the vector one: 


< = 122+002 (11-70) 


V 


It is, however, conceivable to study a limit where axial currents would be conserved, 
at least the strangeness conserving ones. This would correspond to an additional 
chiral SU(2) x SU(2) invariance group, generated by the charges Q* and Q.° 
a— lao 

If the fundamental dynamical variables suitable for a description of hadrons 
include the quark fields, an explicit expression may be derived for the currents 


ne a 
Vi) = F015 A) AO) = ars a) UTD 


We may then obtain equal-time commutation relations for the currents as well 
as for the corresponding charges: 


O*(t) = [ex Vo"(x, t) 

(11-72) 
Q5(t)= [es Ao"(x, t) 

Gell-Mann has postulated that these commutation relations derived from the 


quark model remain valid independently of this assumption on the hadronic 
substructure. If SU(3) is not exactly valid some of the charges may depend on 
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time, while the equal-time algebra remains time independent: 
[0°%, O°] = ifere Q° 
[Q*, Qs"] = ifarcOs° (11-73) 
[Os*, Qs”] = ifancQ* 


We recognize that the commutations rules are those of the Lie algebra of the 
group SU(3) x SU(3). This is readily veritied by constructing the left- and right- 
handed combinations 


Q-" = 3(Q" $ Qs") (11-74) 
which fulfil 
LO" Q.°] aoa Ue eo 
[Q.*°,0-"]=0 


The ordinary unitary group is the diagonal subgroup of SU(3) x SU(3), and 
parity exchanges the two sets of charges 


P0,P* =Q- (11-76) 


The charge algebra gives its meaning to the universality concept since it is 
basically nonlinear. For instance, from 


[cleager Cue | — 203 (11-77) 


it follows that the matrix element of the left-hand side entering a weak amplitude 
is universally normalized with reference to the isospin. 

The relations (11-73) may be generalized in two steps. First we may write 
commutation relations between charges and currents expressing the behavior 
of V,° and A,“ under the SU(3) x SU(3) transformations. This abstracts from 
the quark model the fact that currents belong to a representation (1, 8) ® (8, 1): 


; [O°(t), Vab(x, 0) = ifare V(X, 8) 
[O%(t), Au’(x, t)] = fare Au‘(X, t) 

Osa (Xf) —ifa5-A, (x, t) 
[O57(t), A, °C, t)] = tifa V(X, t) 


From Eqs. (11-78) an integral over space allows us to recover the charge 
commutation relations. 

In a second step the quark model suggests writing equal-time commutators 
for the time components: 


[Vo"(x, t), Vor(y, t)] = ifare Vo"(x, t)6°(x — y) 
[Vo7(x, t), Ao’(y, t)| = i eon tX t)d*(x in: y) (11-79) 
[ Ao"(x, t), Ao"(y, t) | = Hake Vo"(x, t)d°*(x - y) 


(11-75) 


(11-78) 


530 QUANTUM FIELD THEORY 


These relations between time components seem safe. This is presumably not the 
case if we attempt a further generalization to include all the components. Terms 
involving derivatives of the 6 function are likely to appear. We would have, 
for instance, 


[Vatexet)) PPyst)) =a OP ixamy) + SavijQjO(X —¥) (1180) 


Such additional terms were originally introduced by Schwinger in his discussion 
of the conserved electromagnetic current { U(1) symmetry]. In this case let us 
assume that a relation of the type 


Lio(x, 0), jily, t)] = 0 (11-81) 


holds. No 6-function contribution appears on the right-hand side due to the 
vanishing of the structure constants in this abelian case. As a consequence 


Lio(x, 0), V- ily, )] =0 (11-82) 
Through current conservation this also means 
Lio(x, t), Gojoly, t)] = 0 (11-83) 


Taking the vacuum expectation value and inserting a complete set of eigenstates 
of the energy leads in the limit x > y to 


¥; Ex | <0| jo(0) |n>|? = 0 (11-84) 


From the positivity of energy we would conclude that jo vanishes! In this case 
at least, Schwinger terms are unavoidable. 

These extra contributions do not appear in the quantities [Q, j,] obtained 
by integration. The situation is even more intricate when dealing with equal- 
time commutators of space components which are strongly model dependent, 
and will not be used in the following. 

We have already encountered Schwinger terms in Sec. 5-1-7 when we tried 
to write spectral representations for the vacuum expectation value of the time- 
ordered product or the commutator of electromagnetic currents. We observed 
a relation between the Schwinger term and the local noncovariant difference 
between the naive time-ordered product (T) and its covariant conserved version 
(T). We found that 


<O| Ti,(xdi(y)|O> = <O| Tjulx)j(v)]0> — i(Guv — GuoGvo)O*(x — y)~E (11-85) 


with & an integral over the spectral function 


2 
é= {ane aoe (11-86) 


On the other hand, it was also shown that 
<O| Ljo(x, t), iy, t)] |O> = i0,.63(x — y)é (11-87) 


If we assume such relations between Schwinger terms and noncovariant parts 
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of the naive time-ordered product (seagull terms) for arbitrary currents and 
States, these undesirable contributions cancel in the applications of current 
algebra. Indeed, a typical result follows from a Ward identity satisfied by the 
covariant time-ordered product 


oar <A| Tj*(x)j"(y) |BD = <A] TA, 4(x)j"(y) |BD 
+ 8(x° — yA] [j%(x), 7'(y)] |B> or =; (seagull (11-88) 


If the last term is compensated by the Schwinger term arising from the equal- 
time commutator, both may be dropped and the identity reduces to the naive 
one: 


0 
aan SAl TI4(30j"()|BD = <A| T0,5"()7")|BD 


re — 9 Aj PF (FOB? |nsive (11-89) 
This will be taken for granted. The Ward identity will be used to obtain low- 
energy theorems and sum rules whenever some information on the current 
divergence is available. For axial currents we may postulate a partial conservation 
law (PCAC), the precise form of which will be stated below. Let us first, however, 
show how local commutation rules may be tested. This will enable us to introduce 
a new device, the so-called infinite momentum frame of Dirac, Fubini, and Furlan. 
Choose nucleon states of equal momentum p for A and B and average over 
their polarization, thereby defining 


1 me | . 
W,,(p, 4) = z| ae? ~ 5 >> <p] Lit(x), 78(0)] |p> (11-90) 
pol 
An integration over q° generates the commutator 


. | ea® WoolP, 4) = [axe iq-x — 5 lLisex, 0), 73(0)] |p> (11-91) 


2 al 


In typical ped the currents would be chosen as vector densities 
i3° = Vo *” or Vor*'°, with commutators equal to combinations of Vo and Ve. 
For cates 


[exewns ¥ <pl [Vat *!2(x, 0), Ve (0)] [p> 


pol 
=4| ax 50) ¥ <a 130) (11-92) 
Py I 
Agi 


where T? is the third component of the nucleon isospin and states have been 
normalized according to <p| p'> = (po/m)(27)*5°(p — p’). 
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The covariant amplitude W,, is decomposed according to 


PuPy_. PpGe 
Wis = Wi9 uv SP W, 2 = iWseuvpo°5 22 
Guy Puy + Pvdp W, Pudv — Pvdu 11-93 
+ Wa lle ie + 1W6 am2 ( ) 
Here W,,...,We are functions of the Lorentz invariants q* and v = q-p. Con- 
sequently, 
—-W, W. é 
| aa? WoolP, 4) = Po | o( ee. Oe lo Ws) (11-94) 
Po m Pom Pom fixed q 


If we select a frame where p-q=0, and therefore such that go = v/po and 
q’ = (v/po)* — q?, we may rewrite (11-94) as 


—-W, W. 2W, vW. 
| ea? Wostn.0)= po | av ( a 2 siege ;) (11-95) 
fixed q 


ma 
Ame Fe) 
Po m Pom Pom 


The interpretation of the sum rule is still complicated because of its dependence 
ON po. Instead of choosing the nucleon rest frame, where po = m, it is more 
clever to use a limiting one obtained by letting po go to infinity with gq? held 
fixed: 


Do.|p| co 830 p- q= 0 


(11-96) 
gq’? —q’ fixed 
If we are allowed to interchange limit and integration we find that 
1 | 
= [a W,(v, q? = —q’) = lim — | aa? Woo (11-97) 
m porta Po 


This is a severe constraint in view of (11-91) and (11-92), since in particular 
it implies that the left-hand side is independent of gq’. It tests the local character 
of current algebra. 

There are several applications of this type of sum rule. We quote only a few 
of them. The Cabibbo-Radicati sum rule is obtained when applying Eq. (11-97) 
to the isospin vector currents V'*'*, V'~'?. It reads 


dFY F5(0) ?? ey ae” ; 
Z ig (0) = a J or | me (20%)2 — 63/2) (11-98) 


2 
2m 2n om Vthresh 


The notation F{(q’), F3(q*) stands for the isovector nucleon form factors, We, 
such that the matrix element of the electromagnetic current reads 


: " FR + FY rt Temes 6 en 4 
<pal #0) |p.) = ia v(t) ee a( _ a (11-99) 


and 1/2, 3) are the total hadronic cross sections for the scattering of “isovector” 
photons off nucleons in the channels of total isospin 4 or } respectively. The 
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Figure 11-8 Inclusive process y+ N>1+ X,. 


experimental values for the two sides of the sum rule are in good agreement. 
In the unit of the inverse square pion mass one finds 


dFY FS(0) 20 
2 (0) 0.132z 2 Ea ae =| a (261)2 — 03/2) = 0.126m;7 2 


~ dq? 1 v 
(11-100) 


Vthresh 
The inclusive cross section for the process neutrino (v) or antineutrino 
(v) + nucleon {N)— lepton (/) + X. where X is an unidentified hadronic state 
(Fig. 11-8), is expressed in terms of the structure functions W;, W, W3 appearing 
in the expression (11-93) as 
do”) Gok: i. 


ae E+E .,8 
— = 2WY™ sin? — + Wi” cos? —F Wh) —" — sin? — 
alqay law E ( me 7 ie aia 2 


2m 


(11-101) 


The kinematical variables v and q’ are related to the incident neutrino energy 
E, final lepton energy E’, and laboratory scattering angle 6 through 


nage 2) (11-102) 


7] 
q* = —4EE’ sin’ 


The lepton mass has been neglected for high-energy experiments. This explains 
the disappearance of W,, Ws, We in (11-101) and the simple expression for the 
momentum transfer. Such processes will be studied in more detail in Chap. 13. 

For fixed qg* and y, when E (hence E’) grows, 6 tends to zero and the cross 
section may be approximated by the W, contribution alone. Therefore 


iG Ge)? 
lim | 


|| =, dv WS(v, q?) (11-103) 
E7-a@ 


— 92/2 
Crossing symmetry implies 

W (v, q?) = —W(—v, q?) (11-104) 
Current algebra then allows us to write the Adler sum rule 


do™ o) =|. ( 2) ee 
pe = cos 0; + | I, + — | sim’ 0, (11-105) 
Po Crate ical — 
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with 73 the third component of isospin, Y the hypercharge on the target, and 
6. the Cabibbo angle. An analogous result applies to the inclusive inelastic electron 
scattering e + Ne + X using the vector current V'*'*. Since current algebra 
does not apply to the isoscalar part of the electromagnetic current the prediction 
obtained by Bjorken is only an inequality for the sum over neutron and proton 
targets: 


do” doa? 2na? 
; a eae 11-106 
‘ia ata ise 


11-3-2 Approximate Conservation of the Axial Current and Chiral 
Symmetry 


An SU(2) x SU(2) Lie algebra is generated by the commutation rules of vector 
and axial charges. Assuming that axial currents are approximately conserved, the 
resulting symmetry is called chiral symmetry. It is implemented in the Goldstone 
mode with the pions as massless particles. This hypothesis enables us to deduce 
new consequences from current algebra in the form of sum rules and low-energy 
theorems. The generalization to SU(3) x SU(3) including strangeness changing 
currents appears more questionable. 

Consider the axial current matrix element between a pion state and the 
vacuum ~ 


<O| Ai(x) |x*(p)> = ip, o* f, e7 ?** (11-107) 
This amplitude determines the rate of the decay z > yp v as 


G?m2 f2(m2 — m2)? 
eed ~ al a (11-108) 


4nm3 


Experimentally, f, is measured as 


fn =~ 93 MeV (11-109) 
From (11-107) the matrix element of 0”A,(x) is 
<0] 0*A}(x) | a*(p)> = m2 f, oP -* (11-110) 


Therefore current conservation implies f,m2 = 0. We have the choice between 
two possibilities. Either f, or m2 is zero, in contradiction to experimental facts. 
Nevertheless, let us boldly imagine a world with massless pions as a tentative 
approximation. If this SU(2) x SU(2) group is implemented with an invariant 
vacuum we are led to an unrealistic world with parity degenerate multiplets and 
corresponding selection rules. On the other hand, a Goldstone realization is in 
agreement with the fact that pions have a mass much smaller than other mesons. 

Let us therefore explore the consequences of setting 6"A, =0 and m2 =0. 
Compute the axial current matrix element between nucleon states 


<N(p2)| Ak) |N(p1)> = a(p2) = Lyuysga(q?) + duysha(q?)]u(p,) (11-111) 
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J 
At 
w | 
= 
Figure 11-9 Pion contribution to the axial current matrix element 
N (p,) N (Pp) between nucleon states. 


with gq = p2 — p;. Comparing with (11-69) we find 
@4(0) = G4/Gy = 1.22 (11-112) 
Current conservation implies 
2mga(q°) + q’ha(q?) = 0 (11-113) 


from which it is not wise to conclude that g4(0) = 0. Indeed, the form factor 
h(q~) has a pole at q* =0 corresponding to pion exchange (Fig. 11-9). The 
corresponding contribution reads 


ifn a igawnil(p2)yst/u(p1) (11-114) 


with y,vy the pion-nucleon effective coupling constant (Sec. 5-3-4) approximately 
given by 
J2Nnn 


4n 


From (11-113) and (11-114) we obtain at zero transfer the Goldberger-Treiman 
relation 


= 14.6 — (11-115) 


“ 


ga(0) = 24 = QaNN us (11-116) 

Gy m 
in agreement with the experimental data, within a 10 percent error [the right- 
hand side of (11-116) is 1.34 to be compared with 1.22]. This is a quite remarkable 
result since it relates quantities from strong (gzww) and weak (f;, Ga/Gy) inter- 
actions. In the real world with massive pions we would like to have some control 
over the extrapolation from the value q? = 0 to q* = m2. This is achieved by 
assuming a partial conservation law (PCAC) 


OM Ad (x) = m2 f,n4(x) (11-117) 


which identifies the divergence of the current with a smooth interpolating pion 
field. We know already that 0“A, has the correct quantum numbers. From 
Eq. (11-110) it may indeed be used to generate the asymptotic pion states if 
m2 f, # 0. Equation (11-117) is supplemented by an hypothesis of a smooth 
extrapolation of form factors beyond the mass shell. This means that matrix 
elements of 6“A, are dominated by the pion pole for small values of the transfer 
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2 < 25 
momentum |q?| < mz: 


€ 
D(q’) = (A|0"A,|BD = a + (11-118) 


where C is a residue to be determined. This is what is meant in practice by 
partial conservation of the axial current. 


11-3-3  Low-Energy Theorems and Sum Rules 


With current algebra and PCAC, we are in a position to derive low-energy 
theorems. The situation is analogous to the one prevailing in electrodynamics. 
We shall therefore begin with a study of low-energy Compton scattering. 

To simplify we write the amplitude for photon scattering on a spinless target 
of unit charge (Fig. 11-10): 


(22)*6*(p1 + ki — po — ka)F = ie? [ats d*y abe, ererE’ ps | T juleodiul) | Bi 


TF = ie? [as efey e2°* (po| Tj,(x)j(0)|pi> (11-119) 
= e7ehe} Tuy 


Choosing for convenience the polarization vectors such that ¢;:k,; = €2°k2 and 
taking Lorentz and time-reversal invariances into account, we write the general 
form of T,,, as 


Tie Agwa BP,P, + C(Pyky, + Biko.) + Dk. ke. (11-120) 


Here P stands for the average target momentum, P = (p, + p2)/2, and the scalar 
amplitudes A, B, C, and D are free of kinematical singularities. From current 
conservation it follows that k}T,,, = 0. Therefore 


A+ P:k,C +k,:k,.D=0 
P-k,.B+k,-k,C=0 


In the low-energy limit k,;, k2 > 0, the amplitudes B, C, and D are dominated 
by their dynamical pole terms given by Born terms with renormalized residues. 


(11-121) 


isin Ga Ky, € 


1 1 


Figure 11-10 Compton amplitude and Born terms. 
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An elementary computation leads to the value of the amplitude A at threshold 


hime 2 (12122) 

ki,k270 
In this limit, but without any perturbative approximation, we therefore find for 
the cross section 

do «w 

lim — = - 5(é,°6,)* - 
aca Tee (€1 ° €2) (11-123) 

in agreement with the classical evaluation of Chap. | and the lowest-order 
calculation (Sec. 5-2-1). Low, Gell-Mann, and Goldberger who first developed 
these arguments obtained from the same hypothesis the next term linear in ky 
and k, for spin + targets. The amplitude 7 for forward scattering is written 
(with w the common laboratory energy w = p-k/m) 


T = fx(w?) 8; + 8) + iafp(w)(e x &,)- 6 (11-124) 


The polarizations are now taken to have vanishing time components. The low- 
energy theorem then states 


: (11-125) 
flO) = ~ = (g — 2) 


The amplitude f, involves the anomalous part of the magnetic moment of the 
target (g — 2 = 3.58 for the proton). The first relation is of course equivalent 
to (11-122). This prediction is difficult to test directly. It is better to transform 
it into a sum rule as suggested by Drell and Hearn. We use an unsubtracted 
dispersion relation for f:(@?) [the corresponding one for f,(w*) would require 
at least one subtraction] to write 


nim f(o?) _ 1 | do? Ce = 80) 


] ee) 
2 —— 
Jie) = = \, dw Ae ae Fr ty 


Here co .(@7) refer to the total cross section for a circularly polarized photon with 
its spin parallel or antiparallel to the spin of the target. The t-channel exchange 
implied by the helicity flip gives credence to the fact that the integral over the 
difference of cross sections stands a good chance of converging, hence justifying 
our assumption of an unsubtracted dispersion relation. From this relation we 
derive that 


2m? oO 
For protons the left-hand side is equal to 205 yb while the right-hand side is 


between 200 and 270 yb according to the treatment of high-energy data. . 
Let us apply similar techniques to amplitudes involving axial currents. This 


ont'(g — 2)° _ | a Oy (11-126) 
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will give us some information on low-energy pion-nucleon scattering. We intro- 


duce the axial vector matrix element between initial nucleon and final pion- 
nucleon states: 


(22)*6*(p1 + qi — p2 — 42) T* = [ass e !°* (74(qo)N(p2) | An) | N (p1)> 


(11-127) 


According to (11-117) this is related to the pion-nucleon amplitude 7,y. To 
see this we contract both sides of (11-127) with q, and get 


(27)*6*(p1 + qi — po — q2)qtT* = -i [a e791" * (4g) N(p2) | O#AK(x) | N(p1)> 


m2 | 
=—-i — f, | de tu 
mz — qi I 
x (CO, + m2)<m4(q2)N(p2) | 2*(x)| N(p1)> 
Hence 
m2 
qi Tt = —ifz 3 - TF Py (D2; 423 Prs 41) (11-128) 
mo Y1 


When q; goes to zero the left-hand side vanishes unless Tj* has a singularity. 
The only one arises from the nucleon pole (Fig. 11-11). But contrary to what 
happened in the Compton case this singularity is compensated by the vanishing 
of the numerator and qT* ~ qi/qi°p1 > 0 when q; > 0. 
This leads to the Adler compatibility condition for the extrapolated amplitude 
lim 7 fh(p2, 423 pi, 41) = 0 (11-129) 
qi70 
This soft pion limit is of course unphysical. An alternative derivation assumes 
directly m, = 0 and 0,A" = 0. From qT" = 0 we find (11-129) by isolating the 
pion pole from other singularities. 
If we stick to this slightly unrealistic world with zero-mass pions, we can 
write a Ward identity similar to (11-89) for the amplitude 


Ts = | atx e4’* <H(p2)| TAL(x) A¥(0)| H(p1)> (11-130) 


with H an arbitrary hadronic state. 


7 
7 


4 . 
Figure 11-11 Nucleon pole contribution to the matrix element 
N(@,) N(@®)  <nN|A|N). 
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It follows that 


Gant 


i [as et * CH (p2)|5(x°)[ Abo), A¥(0)] | H(p1)> 


_ _ itl | d*x e'°* 54(x)H(p2)| V0)| H(p,)) 


= —e*"(p} + p3) Th (11-131) 
We have used current algebra and the fact that V,; is the isospin current, so 


that Tj) stands for the isospin of the hadron. When P1 = P2, T*’ has a double 


pole at q? = 0, with a residue proportional to the 7H scattering amplitude at 
threshold 


(q°) 
Consequently, the threshold value of 7 x (in the real world when p:q = m,my) 
is given by 
m,MNy_ 


or 
TF nH — G) 
threshold i Tt 


<2T;,°T,> (11-133) 


If T stands for the total isospin in the s channel we have the relation 

(29 n° T,) = T(T + 1) — Ta Te + 1 —2 (11-134) 
This result may be reexpressed in terms of the s-wave scattering lengths in the 
various isospin channels defined as 


Tin 


= 8n(my + mz)ar (11-135) 
threshold 


For pion-nucleon scattering this yields the values 
1/2 = 0.166m;z! a3;2 = —0.083m;} 1/2 + 2a3;2 = 0 (11-136) 

in surprising agreement with the measured values 
aoe = (0.171 + 0.005)mi;, * axe = —(0.088 + 0.004)m,' (11-137) 


3/2 
Similar successful calculations may be performed for pion-pion or pion-kaon 
scattering. 
We can also reexpress these low-energy theorems in the form of sum rules 
using dispersion relations. We split the pion-nucleon amplitude in even or odd 
parts under crossing (Sec. 5-3-4): 


Tm F* GR igh GT - (11-138) 


and use the variables v = p-q and transfer t. Phenomenological considerations 
predict the behavior of these amplitudes for large v at t = 0: 


F ~~ FO yy? 


vo ae 6) 
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Therefore it is likely that 7 ~(v, 0)/v satisfies an unsubtracted dispersion relation. 
The sum rule derived below will then provide a consistency check of this 
assumption. With the pole contribution of the nucleon intermediate state at 
2v + mz = 0, the forward dispersion relation reads 

7190) mada 2)’ _ a7 () 


dy ail (11-139 
v y2 — y? ) 


v a y> — mi/4 1 


mMy 


From (11-133) we obtain the value at threshold (neglecting mz as compared to 
mx in the Born term denominator): 


2 ea) 2% 
ee ee Me a a (11-140) 
‘i my Te Jmyny v Nn 


Finally, we express Im 7 in terms of the total cross sections with the help of 
the optical theorem and use the Goldberger-Treiman relation to substitute 
G,/Gy for fZ. This leads to the Adler-Weisberger sum rule 


7 2 fo 6) 
= a = =e | ; S o**(y) — 6% P(v)] (11-141) 
The experimental agreement is again very convincing. A numerical evaluation 
leads to a value of G4/Gy between 1.16 and 1.24 to be compared to the f-decay 
value of 1.22 + 0.02. 

There is an impressive list of applications of current algebra and soft pions 
techniques to weak semileptonic or nonleptonic decays for which the reader is 
referred to the literature. 


In conclusion, chiral symmetry is a good approximation in hadronic physics. An extension to a 
SU(3) x SU(3) group is not reliable as shown by the large K and » masses. Gell-Mann, Oakes, 
and Renner have given a phenomenological description of the SU(3) x SU(3) breaking by writing 
an effective hamiltonian for strong interactions in the form 


The term H, responsible for SU(3) x SU(3) breaking would be of the form 
Hy, = Equo + Eghig (11-143) 


with uo transforming as a scalar under SU(3) and wg as the eighth component of an octet. A 


phenomenological analysis shows that &g/éo 1s close to re rather than zero. This indicates that 
SU(2) x SU(2) rather than $U(3) would be a better approximate symmetry. 


11-4 THE o MODEL 


We shall now describe a field theoretic model originally introduced by Gell-Mann 
and Lévy (1960) as an example realizing chiral symmetry and partial conservation 
of the axial current. The name o model originates in one of their notations. We 
take this opportunity to study the interplay of renormalization and symmetries. 
Following the ideas of Lee and Symanzik this enables us to introduce the 
machinery of Ward identities which will be useful in the study of gauge fields. 
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11-4-1 Description of the Model 


The o model involves a fermionic isodoublet field w of zero bare mass, a triplet 
of pseudoscalar pions, and a scalar field o. The corresponding lagrangian is 
written 


FL=F$s5+ca 
Ls = wid + g(o + in: tys)|W + 4[(On)* + (60)7] (11-144) 


2 
- A 
aa os nt”) sara ag n*)* 


It is usually referred to as the linear model for a reason to be discussed later on. 
The part /s (the index S is for symmetric) is invariant under an SU(2) x SU(2) 
chiral group acting as follows. The right and left combinations Wr = (1 + y5)W, 
Ww, = 3(1 — ys) transform respectively according to the representations (4,0) and 
(0, 4) while the set (c, 2) belongs to the (4, 3) representation. 
To see this, we rewrite the coupling term 


Wo + ins tys)\W = Wr(o + in tet Wao — inet) 


If (U,V) stands for an element of SU(2) x SU(2) with U and V varying in- 
dependently, the transformation (o + in: t)— V(o + in: t)U ' is an allowed one, 
leading to real o’ and a’ fields. Therefore if we perform simultaneously the 
isospinor rotations Wr UWr, Wr > Vw, the interaction term is obviously in- 
variant. So is the quantity o” + x? proportional to the determinant of (o + iz: 1). 
Finally, the kinetic term Wi@w is equal to WridWp + Wid, showing the invariance 
of the full lagrangian. 

The corresponding infinitesimal variations are generated by the chiral charges 
O% 1 With « running from one to three: 


[Ok We] = —2t°Wr (Qt, Wi] = —at Wr 
ie — 0) t. = 0 
[O%, A [OF Wr] (11-145) 
i i 
[O%, 0] = 5 2" [Qi,0]=—5" 
i i ee cae 
[Ok 27] = — 5 6% + 5 era? Omer a= = So + 58 a 
A compact notation for the (o, 2) transformations is 
[0% (6 + in-)] = (0 + in 0) 
(11-146) 


[Q%,(o + in-t)] = -Lotin-® 


a form which makes the invariance properties of the lagrangian easy to check. 
The Lie algebra of SU(2) x SU(2) is isomorphic to the one of its factor group 
O(4) = SU(2) x SU(2)/Z2 and (6, m) transforms as a vector under O(4). 
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In the absence of the breaking term co, the vector and axial currents are 
conserved. They read 


Vi = Von W + etna! 
(11-147) 
= t* . 
As = Wyss W + (08,n* — n*0,0) 


The breaking term co leaves the diagonal SU(2) group unbroken. The vector 
current remains conserved and the axial current, the expression of which is not 
modified, acquires a nonvanishing divergence 


QM At = — cn" (11-148) 


This model therefore incorporates all the desirable features and the divergence 
of the axial current appears naturally as proportional to the pion field. 

The linear breaking term implies that the quantum a field has a nonvanishing 
vacuum expectation value <0|a|0> = v, meaning that a perturbation expansion 
must be performed taking into account the fluctuations of this field around the 
value v instead of zero. This is achieved by shifting the field as 


o =ao-—D 


and requiring that o’ has zero vacuum expectation value. Reexpressed in terms 
of a’ the complete lagrangian reads 


L = Wid + gv + g(o' + in: tys) |W + 4[(6n)? + (00')7] 
—4(u? + 3Av*)o'? — $(u? + Av?)n? — Avo'(o'* + 27) 
A 
=a (x? + 0’)? + o'(e — pv — Av) (11-149) 


The effect of the translation has been threefold. The mass degeneracy of meson 
fields has disappeared. On inspection these masses now read 


mz = uw? + Av? 
een? ait (11-150) 
Furthermore, the fermion has acquired a mass equal to 
My = —gv (11-151) 


Finally we find a new trilinear coupling o’nz. 

The vacuum expectation value v will be constrained to satisfy a complicated 
condition <o'> = 0. The best which can be done is to implement it perturbatively 
by requiring that tadpole diagrams for the transition o’+ vacuum vanish (Fig. 
11-12). The Born approximation to this condition is, from (11-149), 


c— wv—Av=0 
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Figure 11-12 Tadpole amplitude. 


If we return to Eq. (11-117) we are led to the following identification: 


frmz = —c = —v(u? + Av?) = —vm2 
Therefore, 
La 
(11-152) 
my = Ofx 


which is the Goldberger-Treiman relation to this order where G4/Gy = 1. 

When c goes to zero two distinct situations may arise. One possibility is 
that v also goes to zero, in which case we find the normal mode of symmetry 
described above with massless nucleons. An alternative possibility arises when 
yw? <0, and the limit corresponds to 


i ea (11-153) 


This is the Goldstone phenomenon studied in Sec. 11-2-2, with a vanishing pion 
mass 


m2 = p? + Av? =0 


In this phase the o model may be used to derive the low-energy theorems for 
pion-pion or pion-nucleon scattering. 


11-4-2 Renormalization 


The lagrangian Y(o', x, w) obtained above after translation of the a field, or its 
version in the limit c = 0 corresponding to the Goldstone mode, is renormalizable 
in the sense of power counting. All the monomials in the interaction lagrangian 
are of dimension smaller or equal to four ; the same is true of the possible counter- 
terms. It remains to show that the lagrangian plus its counterterms has a similar 
form as in (11-149), a remnant of the original structure (11-144). In particular, 
what will be the fate of the PCAC relation (11-148)? We shall show that these 
properties will be preserved by performing the renormalization in the symmetric 
normal phase and proving that this is sufficient to treat the cases of explicit 
(c ¥ 0) or spontaneous (c = 0, w? < 0) symmetry breaking. 

To simplify matters we omit the fermion fields. A complete treatment does 
not reveal any new difficulty. We also use compact notations with @ a multiplet 
of n fields transforming according to the vector representation of a symmetry 
group O(n). In the previous instance n was equal to four. We write the lagrangian 
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‘Spacmiliitemes 
aameaaii (11-154) 


5 ; 
ieee 2 » alana 
Ls = 3(09) D ? me 


£5 1s therefore invariant under the transformations 
G>b+66 bb; = O@aT ide (11-155) 


where the Tj are the representatives of the infinitesimal generators of the group, 
in the present case n by n antisymmetric real matrices Tj, + 7,5 = 0. 

It is convenient first to regularize the theory in an invariant manner. This 
may be done, for instance, by modifying the kinetic term into the form 


2 
fers (06) = [ats oie [es ont +a = +b ol + 6 


leading to a propagator with a behavior smooth enough at large momentum to 
insure the convergence of all Feynman integrals. This regularization will be under- 
stood in the sequel without being explicitly written out. 

Consider now the generating functional for connected Green functions in the 
symmetric theory 


es) = | 20) exp | {ar [Ls(o) + 5° 61| (11-156) 


As a consequence of the invariance of the (regularized) lagrangian under the 
transformations (11-155), Gs(j) satisfies 


Gs(j) = Gs[j + 6a(T)j)] (11-157a) 
or, equivalently, 
; dGs(j) 
x Tg = = - 
| >, & Jil) kl Sfx) 0 (11 157b) 


In order to exhibit the structure of the divergences we need a similar identity 
for irreducible Green functions obtained after the Legendre transformation 


i's() +i [as J: ¢% = Gs(j) 
(11-158) 


Since conversely 
_ s() 
Og,(x) 


we derive that I’s(@) enjoys the same invariance properties under the transforma- 
tions (11-155): 


idx) = 
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4 a oI s() ae 
{a X hx(X) Thi 5b 0 (11-159) 
Consequently all counterterms required by renormalization have to be symmetric, 
provided that the normalization conditions are. Indeed, assume the property true 
- up to the L-loop order. Since “s was the most general symmetric polynomial 
of degree four, the effect of counterterms could only renormalize the mass u and 
the coupling constant 4, and affect multiplicatively (all components of) the field 
by the wave-function renormalization factor Z. Since Zs + ALY enjoys the 
same invariance as #s did, it follows that T computed with it up to order 
(L + 1) fulfils the same constraint (11-159). Its divergent part of order (L + 1) 
is the local symmetric polynomial generating the counterterms of order (L + 1). 
This inductive proof appears pedantic here. In more elaborate cases the param- 
etrization of the transformation may be modified order by order by renormaliza- 
tion, but the framework described above remains useful. 
Consider now the complete lagrangian ¥ including the linear breaking term 
c-@. A translation on @: 


@=P%+v <“P=V (11-160) 


is necessary in order to generate the correct perturbation theory. Omitting the 
prime, we have therefore 


_ Jat) exp {i J d*x[Pslo + v) + o°G + OF} 
J D(p) exp {i J d*x [Ls(o + v) + o-e}} 


where the denominator insures that G(0) = 0. Since @ is a dummy variable we 
may undo the translation 


§ Zo) exp {i J d*x [Ys(o) + @— Y)-G+9]} 


(11-161) 


Gi) = (11-162) 
“  Fa(d) exp {i fax [Ls(b) + (@ — V)-e]} 
and express G(j) in terms of the symmetric functional as 
G(j) = Gs(j + c) — Gs(c) —i [atx jv (11-163) 


where v is determined by the condition that the derivative of G(j) with respect to 
j vanishes at j = 0: 
_ 6Gs(j) 


= (11-164) 
Me = F5i(x) 


jaye 


Similarly, 
iT (p) = Gi) —i [ee “o 


= Ga) + ©) — Gait) —t [ates (@ + v) (11-165) 
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6G(j) dGs(j + ¢) 

= peialy 3. COST Sn (11-166) 
wie Cee amy 
Thus 


T(¢) = Ps(@ + v) — Ts(v) + [ats c+ P(x) 


(11-167) 
= ol s(v) 
Oe Ov, 
The identity (11-159) expressed in terms of I'(@) states that 
or (¢) 


[ex [ biAx) iF vx] wil —=iGi | = 0) (11-168) 


Let us look in more detail at the content of (11-168). For instance, take one 
derivative with respect to @ and set @ = 0: 
Srp) | 


dT (¢) | ; 
Ti ~ + 0m Tin > —_____—__— 
aha “| $=0 are Soba ObUN) | p=0 


Taking into account that 61° (@)/d¢,(x) — = 0, which follows from the definition 
of v, and the antisymmetry of the matrices 7, this means the following relation 
for the inverse propagator I\?)(p”) at zero momentum: 


Te [ c1O mx ata v1 2(0)] =0 (1 1-170) 


=0 (11-1) 


This shows that ¢ and v are collinear, a fact already clear from Eq. (11-167). 
If we think of c as an external magnetic field then the magnetization v is along c. 
A mass parameter for the transverse states (with respect to c) is defined according 
to 


T?(0) = —m? (11-171) 
and c= mrVv (11-172) 


The quantity m7 plays the role of a pion mass in the o model and generalizes a 
result obtained in Sec. 11-4-1. From (11-167), it follows that once the symmetric 
theory is renormalized so is the broken one. Explicitly, if 


Tsa(@, 17, A) = Vs repg(Z 1/2, uw? + Sp, Ao) (11-173) 
the corresponding renormalized functional for the broken symmetry case is 
Pa(@, HW, A, €) = Tyeg(Z 7/7, uw? + 5p?, Ao, Z~ 7c) (11-174) 
A rescaling of the symmetry breaking parameter c has been necessary: 
Cro 2 6 (11-175) 


in such a way that c- @ remains invariant, ¢- @ = cg - do, and insures that 
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Vo = Zil2y 
OV sr(v) 
Ck Sy, (11-176) 
ces > OTs reg Vo) 
OVox 


Equation (11-167) implies that the amplitudes for the case with broken symmetry are obtained by 
resumming tadpole insertions in the symmetric theory. For instance, the p-point Green functions 
(p > 1) may be written as (Fig. 11-13) 


PE: lees sph } a fen OG eve lire mer eateen dX is,-->Xp Vises Ye) (11-177) 


For compactness we have assumed v along the first axis in isotopic space. 

This discussion of renormalization may also be carried out when the breaking terms are more 
complex and involve higher-dimension operators. The lesson to be drawn from the work of Symanzik 
is that breaking terms of dimension w <4 only require counterterms of dimension lower or 
equal to w. For instance, a mass-breaking term (w = 2) will not affect the counterterms of degree three 
or four which will remain symmetric. A soft breaking (w < 4) leaves a remnant of the initial symmetry. 
A hard breaking (w = 4) will a priori destroy completely the symmetry. 

These properties on the ultraviolet divergences have their counterpart on the asymptotic behavior 
at large momenta of renormalized Green functions, at least in the euclidean region. This is another 
aspect of Weinberg’s theorem (Sec. 8-3-2). Thus a soft breaking will not affect the asymptotic regime 
which will remain symmetric. 

The reader may wonder what is the link between this discussion and Ward identities 


eas): A> 
Ox 


“ 


== (TO"fi(x)A1(x1)-** An(Xn)> + e €T A4(x1)*** 5(x° — xP) [j6(x), Ap(Xp)]*°* An(%n)? (11-178) 


encountered in quantum electrodynamics or in the applications of current algebra. It is easy to 
convince oneself that the former are relations of the kind of (11- all integrated over x. The fields 

A,(xp) are identified with , (xp) and j is the Noether current ji(x = [0,dx(x)] Té[OAx) + vj. Upon 
integration the left-hand side vanishes as a total derivative — boundary contribution in the 
absence of massless states. On the right-hand side @“j2(x) is equal to c) Tit [@m(x) + Un] and the last 
term involves the variation of the field 


5(x° — y°)[j5(x), de(y)] = 64(x — y) TAL Oy) + 21] 
We therefore obtain the identity (11-168). 


; Xqs!y 
X2,!2 


Figure 11-13 Summing tadpole diagrams. 
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We let the reader derive (11-178) from the functional integral (11-162) by observing that it is not 
affected by a change of variable 6 > @ + 6, 66 = (T%@)dw,(x) with an x-dependent 6a,(x). This 
dependence is responsible for a variation 6 %s(p) — j+ 0(6c) according to the definition of the current. 
A derivative with respect to dw will yield Eq. (11-178). The renormalization of this identity may 
first be studied in the symmetric theory to show that the currents are not renormalized or, in other 
words, that Z;= 1. Moreover, this property is not affected by a soft breaking term so that the 
multiplicative renormalization constant for the current remains equal to one. Are there new problems 
when we try to derive identities with several currents? 


We are now in a position to discuss the interesting case with spontaneous 
symmetry breaking. At first sight it would seem reasonable to start with the 
renormalized symmetric theory with u? > Oand then continue to the region p? < 0. 
The trouble is that this procedure involves a transition through a singular point. 
Physical quantities such that m#(,2, c) will not be analytic in u* for c = 0. To cope 
with this difficulty we may introduce a small linear breaking term (c 4 0) to turn 
around the singularity. In Fig. 11-14 we show how the Goldstone phase may be 
reached along the path «By by successively varying c and y”. From the previous 
analysis it follows that counterterms pertaining to the symmetric theory (point x) 


Figure 11-14 The behavior of mz and f, = —v as functions of y? in the absence (solid line) and in 
the presence (broken line) of an external field c. 
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will insure the finiteness at the broken symmetry point £. Symmetric mass counter- 
terms will lead from f to y corresponding to the variation in ,.”. Finally, in the 
limit of vanishing c (point 3) the renormalized functions will satisfy the identity 


[as [ox(x) + vx] Til re =0 (11-179) 


and the limit of Eq. (11-170) will read 
v1 Tim me (0) = 0 (11-180) 


This means that the (n — 1) transverse bosons are massless, being the Goldstone 
bosons of the spontaneously broken symmetry. 

Equation (1 1-179)implicitly contains all the relations among Green’s functions 
in the Goldstone phase and implies a series of low-energy theorems. 

The fact that the linear o model and its Goldstone limit verify all the 
constraints of current algebra and PCAC even at the level of Born terms suggests 
the use of it as a phenomenological lagrangian, in the same spirit as the Fermi 
theory for weak interactions. We may in fact find a number of phenomenological 
interactions with this property. The spirit of this approach is to ignore re- 
normalization problems and to make use of the relations arising among processes 
to lowest order. 

An example is provided by the nonlinear o model. The name originates from 
the nonlinear realization of the chiral group on the manifold 


o7(x) + 07(x) = fr (11-181) 
Expressing o in terms of the pion field the resulting lagrangian is written 
‘ 2 
eae | (a: en) (11-182) 


9. Lf2 = rita 
At the classical level we see that the chiral symmetry is realized in the Goldstone 


mode. The composite field o = \/ f? — m* has a nonvanishing value and the pion 
is massless. 


Even though this theory is not renormalizable according to usual criteria in four-dimensional space- 
time, it has aroused great interest for its applications in statistical mechanics where it describes the 
continuous version of the classical Heisenberg model. Moreover, a theory such as (11-182) is re- 
normalizable in dimension two where a scalar field is dimensionless and the Ward identities may be 
shown to constrain the structure of counterterms. It then enjoys a number of features in common 
with the recent field theories of strong interactions. 


11-5 ANOMALIES 


Up to now we have only encountered examples where renormalization did not 
conflict with the symmetries apparent at the classical level. In this section we 
discuss situations of great interest where the opposite is true, and new features are 


550 QUANTUM FIELD THEORY 


uncovered by renormalization. In Chap. 13 similar phenomena will appear when 
dealing with scale invariance. We introduce this subject with an apparent paradox 
in the application of current algebra. 


11-5-1 The x° + 2) Decay and Current Algebra 


When reviewing the successes of current algebra in Sec. 11-3 we carefully omitted 
what seems at first a failure in its application to the 2° decay in two photons 
(Fig. 11-15). Let us now study this process. The amplitude may be written 


F =F (q)| 2am = lim eferT..(q) (11-183) 


qm; 


Tq’) = (q@? — m2)e7e4e3 | d*x d*y (0| Tj,(x)j(y)n(0)|0> ert” (11-184) 


We have exhibited the time-ordered product of two electromagnetic currents 
coupled to the two photons (k;,&, and k2,€2) and of the pion field (x). Here 
gq =k, +k, and the photons are on-shell kj = k3 = 0. In the spirit of PCAC 
(11-117) we are led to replace the neutral pion field by the divergence of the axial 
current, omitting the isospin index 3. Therefore, 


> m2 i ae 
CS aa er efe3q" Tivo (11-185) 
with Typ Se | dbx dty ef ** GO| Tj OPA ONO (11-186) 


To extract the derivative acting on the axial current from the time-ordering 
symbol we have used the fact that the commutator d(x° — y®°)[j,(x), do(y)] 
vanishes for the quantum numbers entering (11-186). Let us now write the most 
general decomposition of the tensor T,,, .(k1, k2). This involves taking into account 
a negative parity from the axial current, a symmetry in the combined exchange 
(ki, u)<>(k2, v) from the Bose statistics of photons, and the transversity to k{ 
and k} as a result of the electromagnetic current conservation 


kt Twp = k> Dip =0 (11-187) 
Ky,€ 


ky,€2 Figure {1-15 1° — 2y decay. 
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With this information we may write the following expression when ki = k3 =0: 
Tugglle Kk) = Envork {kK 34 14(q7) a8 (Siceae kay i Expaaltiina RT hes Tx(q”) 


ay [(Enpackay = Evparkay)kik} ia Env path re k3)ky *k2] T3(q7) (11-188) 
‘ Consequently, 


lias — Euverk{k5q"[ Ti(q’) sr T3(q7)] (11-189) 


We evaluate these amplitudes to leading order in electromagnetism. Since no 
strongly interacting zero-mass particle is around (m2 is considered to be different 
from zero) it follows from (11-189) that 


7 (0)=0 (11-190) 


For the soft pion theory to be valid, this should be interpreted as meaning that 
TJ (m2) is suppressed : 


TJ (m2) ~0 (11-191) 


The dominant part of the 7° > 2y is forbidden according to this observation of 
Sutherland and of Veltman. Fortunately this conclusion turns out to be incorrect 
due to the effects of renormalization. We stress that this does not question the 
validity of the extrapolation from (11-190) to (11-191) but really means that the 
statements (11-189) or (11-190) have to be modified. A method of approaching 
the problem is to perform a perturbative calculation within a given model. The 
interpretation of the result will be that we have to modify the PCAC relation 
(11-117) in the presence of electromagnetic interactions. 


11-5-2 Axial Anomaly in the co Model 


To be specific we use the o model with fermions. For simplicity we keep only 
one Fermi field of charge +1 (the proton) and two mesons m° and o. From 
Sec. 11-4-1 the lagrangian is 


2 2 
m m 
ier peek ayer) 


L=w[id — m+ glo + inys) |W + 3(6n) + 4(00)? — rae ; 
24) +o?) 11-192 
— dva(o +a)— an + o7) ( ) 
To lowest order 
m=—gv mz—mz=2Av* (11-193) 
The proton is the only charged particle to contribute to the current; therefore 


Ju = Wy 
The axial current 
A, = 4Wyyysw + (60,2 — 20,0) + v0, (11-194) 
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len (0 ky,” 


ky.v ki,h 


Figure 11-16 One-loop diagrams for 2° decay. 


has a divergence formally given by 


6A, = m2 f,T j<=—2 (11-195) 
The z°-decay amplitude computed to the lowest one-loop order is (Fig. 11-16) 
Thy = Th (Ks, kz) + Ta (ke, 1) (11-196) 
(1) 2 a He ae ua + m)y(g + B + m)ys(b + m)yv] 
fie (ki, k2) = ge 2 2 
* [lg + p — ki)? = m7] [(g + p)? — m?](p? — m?) 
The trace in the numerator is equal to 4ime,,,,k@k3 (with 9123 = —1) and the 


remaining integral is convergent: 


TY) (k1, k2) = 4ige?meyypck@ks ~ 


d* 1 
i |< [((q+p—ki)? — m7] [(q + p? — m*}(p*? — m?) 


= 4ge*mMEpypok{k3 


] 1 1-x 1 
(47r)2 d iT Le. oS ok SE ca 
x (4x)? i, a8 | dy i = at ee, eee (11-197) 


On-shell we have kj = kj =0, 2k,-k, = q? and we find for the contribution of 
the two diagrams 


74 1 Lo 1 
ii 2 yoo tte ak eks 2 
(q ) Fe EuyvpoE1E2K {KD : dx Fs dy RS = xq? (11 198) 
This is in contradiction to Eq. (11-190) since 7 (0) is not zero, given in fact by 
e2 


F 0) = grr Sanpete SKERS (11-199) 


pee that (11-198) implies a smooth extrapolation in the neighborhood of 
q’ = 0 with 


F (9?) = (: + = 4 ~)70 (11-200) 
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Using g/m = 1/f,, fr ~ 93 MeV, the numerical rate obtained from the leading 
contribution 7 (0) is 


1 | ak, dk; 


Ta5c 2 


| 7 (?(2n)*0(q iis) 


2mz ent, | 4(27)°a1.0@2 
a*m3 


where care has been taken to integrate on a 27 solid angle to comply with Bose 
statistics. The experimental rate 


ae (7.57 + Sev (11-202) 


compares favorably with the above estimate. In fact, as early as 1949, ignoring 
all subtleties of current algebra, Steinberger had derived a similar estimate. 


In the quark model the electromagnetic current is coupled to fractionally charged fermions (charges 
Q;). The axial current reads 


r = th oer 
An => Wires = W; + meson contributions 


The amplitude 7 would be multiplied by a factor x = };15Q?. For a quark triplet (u, d,s), the 
charges are Q; = (3, 3. —4) and the t4 are (1, —1,0); thus x =4. The agreement with experiment is 
lost unless quarks are three times degenerate according to a hidden color quantum number. This is 
in fact one of the most direct evidences of the necessity of such a degeneracy, together with the 
question of statistics in baryonic states. 


We have now to understand what invalidated the Veltman-Sutherland result. 
For this purpose we compute the amplitude T,,,, defined by Eq. (11-186) within 
the same model. From (11-194) the axial current is coupled either to the pion 


with a factor ig,v = —ig,f, or directly to the proton. We have therefore, to 
lowest order, 
> a 1 
Thvp = ley. ed me — (ire (11-203) 


The first two diagrams of Fig. 11-17 contribute to T,,,,; the last two involve 
the pion amplitude T,,. The questionable identity written in (11-185) takes the 
form 


2 2 
Mm, — q 


fm Gein — Lay (11-204) 


- or, alternatively, 


m 


Gls oa Jaliy = g Ty (11-205) 


Of course we want to insure that renormalization preserves the electromagnetic 
current conservation (11-187). Let us compute the first two diagrams of Fig. 11-17. 
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Ki, ky,v Kio 
‘ <a ‘i a < <q 
Ko, kestt ay 
(a) (b) (c) 
ky. 
wef 
ky > u 
(d) 


Figure 11-17 Lowest-order diagrams for the amplitude T,,,,. 


With a factor 3 arising from our definition (11-194), we find 


Tks, ko) = Thrplka, ke) 


a | dep i 
=j— Pr sh, See 5 
1 5} |< (+3 Pompe is a (11-206) 


Contracting with q? yields 


PP key hea) = S| TP, ELBA anal B+ a Hh + list = he omy] 
sea 2 | Qa (p> —m*)[(p +k, —m?] [(p—k2)? — m7] 


(11-207) 


The central term in the trace may be arranged in the form 
(B+ Ki + m)(Ki + Ka)ys(B — Ko + m) 
= (p+ Ki + m)(p + ki —m+ ko + m— p)ys(B — ko + m) 
= Im(p + ki + m)ys(p — Ko + m) + [(p + ki)? — m7] y5(p — Ko + m) 
+ (pb + ky + m)ys[(p — k2)? — m?] 
Thus 
g? Tika, ke) = z Thy (ka, Ka) 


e | d‘p c + m)yus(B— Ko + m)yy (B+ ki + m)yvys( + m)y, 
27 Om: (a = ae) [(p — k2)? — m? | [(p + k,)? - m? \(p? =e) 


(11-208) 
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The first term has been recognized as the amplitude T,{))(k,, k2) occurring in 2° 
decay [Eq. (11-196)]. Hence the validity of (11-205) depends on the remaining 
contribution vanishing. At first sight we would think that each term contributing 
to this integral is zero, as it is a Lorentz pseudotensor depending on one four- 
vector argument. Moreover, we might also argue that a translation of integration 
variable p—p-+k, would show a cancellation between the first term in the 
integrand of g?7,1) and the second in gq’ 7,2). Both arguments are wrong because 
of the linear divergence of these integrals. A safe method is to use a Pauli-Villars 
gauge-invariant regularization. Dimensional regularization is not suited here, 
because of the presence of 75 OF €,+¢, particular to four-dimensional space-time. 
The introduction of a large mass (M) regulator field transforms (11-208) into 


— rane ; M 
PLE) — FOU] = E T3P (om) — = m0 | (11-209) 
¢ 
since the previous arguments are valid for the regulated finite integral appearing 
in (11-208). From the calculation given above we can easily find 
M o 

lim — T{P)(M) = — Eqvpok iki 11-210 

Pine zy (M) an uve 1%2 ( ) 
The result of regularization is to uncover this finite term which represents the 
effect of the second integral in (11-208). Finally, by adding the contribution of 
diagrams (a) and (b) in Fig. 11-17 we find 


Ps m 


Fav = Ta — ~ Exvpok RkS (11-211) 


with an anomalous term on the right-hand side correcting (11-205). Note that the 
identities (11-187) are still preserved. 

We might ask whether it is not possible to eliminate this new term in (11-211) 
using a different subtraction scheme. After all, even though q?Thyp is finite this 
is not the case for T,,,,. which has a linear divergence. The unknown subtraction 
term Tie, must be a linear polynomial in k,, k2, and transform as a Lorentz 
pseudotensor symmetric in the exchange (kj, jt) <> (ka, v). The only candidate has 
the form 


AT vp = constant Eyype(k1 — kz)? (11-212) 


but does not satisfy (11-187). It is possible to find a Ward identity without 
anomalies for the axial current [ie., of the form (11-205)], but the price is to 
lose ordinary gauge invariance. . 

The anomaly which has appeared in (11-211) is welcome since it leads to a 
satisfactory 7° lifetime. Rather than accumulating all evils to preserve the PCAC 
relation, it is more fruitful to correct it to first order in « to read 


od 


04"A3 = f,m2n° — 7a 


uve FF? (11-213) 
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where F,,, is the electromagnetic field strength. No result obtained previously is 
affected by this correction. The impossibility of maintaining for the regularized 
or renormalized theory all the Ward identities implied by the classical approxima- 
tion justifies the name given to these anomalous Ward identities. Let us study 
some of their properties and consequences. 


11-5-3 General Properties 


The phenomenon analyzed in the framework of the o model is common to all 
theories with fermions coupled to axial currents when we try to satisfy simul- 
taneously axial and vector Ward identities. It was discovered as early as 1951 
by Schwinger and was explicitly studied by Adler for electrodynamics and Bell 
and Jackiw for the o model. 

In electrodynamics we want to verify the relation 


0,53 = OnWy"ysh) = 2mipysp = 2mjs (11-214) 


which states that the failure of axial current conservation arises from the fermionic 
mass term. The axial current of Eq. (11-214) differs by a factor 2 from the one 
discussed above. A consequence of (11-214) would be a relation among Green 
functions, analogous to (11-205), of the form (Fig. 11-18) 


q?’Ruvp = ie?q? [a ayer! oes?) CPn (x) jv(y) js .(0)> 


= 2me? [atx dty lrx*"Y Ti (x)j(y)js)> (11-215) 


= 2mR,, 


Let us introduce the proper functions for the axial current and the pseudoscalar 
density: 


ls,(p’, p) = | d*x d*y eX 9" P-*) (Tis (O)W(y)W(x)>r 
(11-216) 
I's(p’, p) = | d*x d*y el? P°) (Tjs(O)W(y)W(x)>r 


Ky ob 


Figure 11-18 Green function with two vector and one axial 
k,,v Currents. 
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where the subscript T means truncation of the fermion propagators. From (11-2 14) 


(Du — PwIS(p', p) = —2imI's(p’, p) + ysS~*(p) + S*(p)ys (11-217) 


According to our usual notations iS(p) is the complete fermion propagator such 
that to lowest order S ‘(p) = p — m. Equation (11-217) is the analog of the Ward 
identity for the vertex function (8-87). 


(Pi — PulA“(P’, p) = S~*(p') — S~*(p) (11-218) 


Equation (11-217) would imply a common multiplicative renormalization of mj, and j£ with re- 
normalization constant Zs; = 1. 


In fact, the identity (11-214) and its consequences, (11-215) and (11-217), are 
not verified perturbatively. They are modified by anomalies arising from the 
triangular diagram, as was the case for the o model. The computation given in 
the previous section can readily be extended to the present case with the result 


2 
GP Ragn — QAR ny. + — CE ppak fk (11-219) 
Tt 


as the corrected version of (11-215) to one-loop order. In operator form the 
anomaly reads 
8, 4x) = 2mjs(x) — = Euvpe FY FP? (11-220) 


Similarly, (11-217) is modified as 


Oo 
get (PD) 


(p, — pwVS(p', p) = —2imI'5(p', pp) + ysS~(p) + S“*(p')ys + Ve 


(11-221) 
F(p’, p) = | d*x d*x! cP’ ¥~ 2°) ¢Tb(y)W(x)Euvpo FF ?(0)>r 


= 


As a consequence of (11-221) evaluated at zero momentum, 2mj; has still a renormalization constant 
equal to unity. The same is no longer true for j#. The breaking of chiral invariance now involves a 
hard component Eyyp¢F*"F°’. 


The structure of anomalies in a renormalizable theory with fermion fields 
endowed with an internal symmetry coupled to vector, axial, scalar, and pseudo- 
scalar fields may be analyzed along the same lines. The general conclusion 
is that only diagrams containing fermion loops coupled to vector and axial 
currents lead to anomalies. Axial currents must occur in odd numbers along the 
loop. Moreover, any loop containing a scalar or pseudoscalar coupling may be 
eliminated by an adequate subtraction. If we insist on preserving the normal Ward 
identities for vector currents, the diagrams of Fig. 11-19 lead to anomalies for 
the axial ones. As in the triangular case, the manipulation performed when trying 
to verify the identity 0,j$ = 2mjs involve divergent integrals up to the pentagon 
diagram. 
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Figure 11-19 One-fermion-loop diagrams leading to anomalies in the axial Ward identities. 


Let 7, and .%, stand for the vector and axial fields considered as matrices 
acting on the internal symmetry indices of fermions in such a way that the 
interaction lagrangian reads 


Lint = wy"(% oF ys A) (11-222) 
The corresponding currents 
Of ew? 
Dh = Sars J§ = —— jle 3 
ov; > OE —— 


are also matrices in the internal symmetry indices. The vector Ward identities 
are by hypothesis the normal ones: 


Cello I(x) (11-224) 


while the axial current anomalies take the form 
a 1 vpo 
0, J §(x) = js(x) — We a pP (AFF yo ar TF iF 25 


+ 31, AP og + FF iv LpVq + iA, FLAS, — 4a 1, SAA) (11-225) 
In these expressions j and j; denote the naive divergences of the currents and 
iy a OF. = Ce, rae iz. saa i Ra S| 


; | (11-226) 
Fay = On, ry Oy A, _ i[ %,,, 4 v1 7 [Vis Ay | 


Neither the structure nor the coefficient appearing in expressions such as (11-213), 
(11-221), or (11-225) are modified by higher-order corrections provided a suitable 
definition is given to the renormalized operators. To see this, let us consider the 
case of electrodynamics, for instance. The argument relies on the existence of a 
regularization of the photon propagator, in the form —4F,,(1 + 017/A*)F*”, for 
instance, such that all diagrams of interest except those with one loop become 
superficially convergent. Indeed, the new power counting gives for a diagram with 
E, external fermion lines, E, photon lines or current insertions and L loops: 
w = 8 — Er — E4 —4L. Therefore amplitudes with L > 2 will superficially con- 
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verge. It is, of course, understood that the standard renormalization of quantum 
electrodynamics has disposed of all internal divergences. This regularization is 
gauge invariant and does not modify the structure of the axial current. Conse- 
quently, the only anomalies are those arising from one-loop subdiagrams which 
we have just analyzed. 

We note that it is possible to define a different axial current with a normal 
Ward identity but violating gauge invariance. In electrodynamics, for instance, 
such would be the case for 


Jus = jus + = buvpo A’F”’ (11-227) 


ow 
2n 
which satisfies 

6,,j4 = 2mjs (11-228) 


These different possibilities can be interpreted in a different language. We could try to construct 
directly the operator j§ in terms of the fields y and w in the presence of an external potential. Since 
combinations such as y(x)~(y) are singular in the limit x — y we separate the arguments by an 
infinitesimal space-like interval ¢, therefore defining 


JP*(x, 6) = U(x + &)y"ysv(x) (11-229) 


This new operator is not gauge invariant. Following Schwinger’s suggestion, this is corrected as 
follows. We multiply (11-229) by a phase factor involving the integral of the vector potential along 
a space-like path from x to x + ¢ along which the various components commute: 


= Te ee exo | -i | : “det A,)| (11-230) 


If we heuristically use the equation of motion, the divergence of this operator can be written as 


ae 


CPO é) = Ami (x + £)ys(x) exp |- | 


x 


dz? 4,(0) — iej4(x, €)F,(x)e"[1 + Ofe)] (11-231) 


The second term on the right-hand side is singular in the limit e+ 0. The vacuum matrix element 
of j# in the presence of an external field has a 1/e behavior from which we recover in the limit 


Ss 


0| 2, 780e)|0> = <0] 2mi(x)»sW(x)|0> — = EnwpeF F(x) (11-232) 


The axial anomaly modifies the PCAC expression in the presence of electro- 
magnetic interactions [Eq. (11-213)]. The added term is a hard one, Siena Ne 
the renormalization properties of the axial current. We may return to n° decay 
and express the amplitude as 


ete T,,,(q2) = (m2 — q?)<y(k1, €1); (ka €2) | (0) |0> (11-233) 


Ne according to (11-213), may also be written 


m2 — 
eke) Tyy(q? = m2 f me 2 c(h y(k2, €2)|0°A3(0)|0> 


+ = Epo pror<P(k1, &1), Y(K2, €2) Dee es >| (11-234) 
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To lowest order in « the second matrix element is equal to (x/m)é{e2€uvpoktk3, 
while the first one vanishes in the limit g? = 0. We therefore reach the conclusion 
that 

Tys(0) = — Exvpok SKE (11-235) 

Th 

in agreement with (11-199). This low-energy theorem is in fact valid to all orders 
in « due to the nonrenormalization property of the anomaly. On the other hand, 
the extrapolation to q2 = m? will depend on the order of the approximation. 

We conclude that anomalies are not an artefact but a true product of re- 
normalization, involving a deeper aspect of field theory. Each time a soluble 
model is available we can check them explicitly as in the two-dimensional 
Schwinger model of electrodynamics with massless fermions which exhibits a 
computable anomaly of the form 


Oyj = CEyyF” (11-236) 


The appearance of anomalous dimensions in the asymptotic behavior (Chap. 13) 
will offer a new insight into these phenomena. 
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CHAPTER 


TWELVE 
NONABELIAN GAUGE FIELDS 


After reviewing their geometrical background we present in detail the quantum 
theory of Yang-Mills fields. The emphasis is on the quantization and the re- 
normalization procedures, both in the symmetric and spontaneously broken 
cases. Vacuum degeneracy and Classical solutions are briefly evoked. Applications 
to the unified theory of weak and electromagnetic interactions are studied in the 
framework of the Weinberg-Salam model. 


12-1 CLASSICAL THEORY 


The generalization of gauge invariance to nonabelian groups was a very appealing 
and natural idea, first proposed by Yang and Mills in 1954. In spite of the 
large amount of work they inspired, these nonabelian gauge theories had a rather 
slow development until the end of the 1960s. The important problems of 
quantization, renormalization, and mass generation were then brilliantly solved. 
It is believed that a theory of this sort can provide a unified description of weak 
and electromagnetic interactions, and possibly also of strong interactions. 

These models offer all possible intricacies. Their quantization and re- 
normalization are difficult, they exhibit remarkable mechanisms of symmetry 
breaking, and they have a unique behavior at short distances, while most aspects 
of their long-range behavior are not yet fully elucidated. It has recently been 
realized that even the classical theory has a fascinating complexity. 

It is almost impossible to present such a wealth of information within a few 
dozens of pages. We shall restrict ourselves to the basic topics and only mention 
some of the possible developments and problems. 
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12-1-1 The Gauge Field A, and the Tensor F,,, 


The model considered by Yang and Mills was based on the isotopic symmetry, 
with the group SU(2) of global invariance. Was it possible to transform this 
invariance into a local one, so that the reference frame used to define the isospins 
could vary from point to point? If so, the information that a particle produced at 
a given space-time point was a proton, say, was meaningless for a different 
observer, unless there existed a way to compare their two frames. This is the role 
of the gauge field, very much as in electrodynamics relative phases of charged 
fields at different points make sense only when compared via the electromagnetic 
potential. 

To be more explicit, let us consider N fields (Lorentz scalars for simplicity) 
transforming according to an irreducible representation U of some compact Lie 


group G: 

p(x) — (°p)(x) = U(g) P(x) (12-1) 
where U(g) isan N x N unitary (or orthogonal) matrix. We assume the lagrangian 
invariant under the transformation (12-1). We want to build a more complex 
theory, still invariant when g depends on the space-time point x. Let g(x) be such 
a G-valued function. The derivatives 0,@ have no longer any intrinsic meaning 
as we compare fields (x + 6x) and (x) that transform independently under 
(12-1). It is necessary to introduce a new object to allow this comparison. 

Consider an infinitesimal transformation 


g=e-+ oa,t? (12-2) 
where e is the identity of G and the f* are elements of the Lie algebra satisfying 
the commutation relations 

ane eet (12-3) 


In the representation associated to the fields ¢, antihermitian matrices T° represent 
the elements f? of the Lie algebra, so that under the transformation g of Eq. (12-2), 


(x) + (°b)(x) = (x) + 54(x) 
O(x) = ba,T°(x) = dap 
If the infinitesimal parameters 6«,(x) depend on x, these transformations will be 


called (local) gauge transformations, and the transformation group—the gauge 
group—will formally be the infinite product ees 


5h(x) = 5a(x) G(x) = Seta(x) T*O(X) (12-45) 


We introduce the gauge fields x > A“,(x). They are vector fields and carry an 
index of the adjoint representation of G. We denote A“ the corresponding element 
of the Lie algebra: 


(12-4a) 


A"(x) = A®,(x)t? (12-5) 


and we will use the same notation for any of its representations, A",(x)T*. What 
is understood should be clear from the context. 
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To an infinitesimal path (x, x + dx), the gauge field associates an element in 
the group 
g(x + dx, x; A) = e + dx*A,(x) (12-6) 


providing the means to compare two neighboring frames. This generalizes to a 
finite path C going from x, to x2. If s > x(s), 0<s <1 is a parametrization of 
this path: s(0) = x;, s(1) = x2, the element of G associated to C 1s 


1 
g(C; A) = Pexp (| is.) 
0 ds 


(oe) 1 So S Zi ; 
= y|) ds, \, GS,=1 _f ds, jg Alon] el (12-7) 


The path-ordering operation P is the analog of the familiar time ordering T. 

In the language of differential geometry, the A” form a connection and define 
a parallel displacement of geometrical objects belonging to representation spaces 
of the group. A parallel displacement of ¢ from x to x + dx is defined through 


dix) = G(x) = G(x) + dx A(x) G(x) (12-8) 
The definition of the covariant derivative follows naturally: 


dx" Dyp(x) = O(x + dx) — (x) = dx"[0, — Ay(x)]O(x) 


(12-9a) 
D, = 0, — A,{(x) 
or, in components, 
(Dy)a® = 8,64" — (Ta? Auc(x) (12-9b) 
In particular, in the adjoint representation (T‘),° = C®, = —C*,, 
(Dy)a” = Onda" + C”aAue(X) (12-9¢) 


If G is the group U(1), this reduces to the familiar concept in electrodynamics. 
The gauge field is endowed with a transformation law under gauge transformations 
such that ¢,(x) transforms as ¢(x + dx). To this end, it is sufficient that 


g(x + dx, x; A + 6A) = g(x + dx)g(x + dx, x; A)g™ '(x) (12-10) 


Hence 
(x) > U[g(x + dx, x; A + 6A)]U[g(x)]6(x) 
= Ulatx + dx)] 00) pny 
as desired. In infiriitesimal form, this yields 
6D,O(x) = da(x)D,.b(x) (12-12) 


which means that D, (x) transforms as $(x). Expanding (12-10) to first order in 
6a results in 


5A, = 6,50(x) + [5a, A,] = D,Sa(x) (12-132) 
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where 6« is considered as transforming under the adjoint representation. More 
explicitly, 


SA ya(x) = O,5ctq(x) + C500,(x) Aye(x) (12-135) 
For a finite gauge transformation, Eq. (12-10) gives 
Aux) > 9Ay(x) = [8,90)]97 0) + 90) Aug 100) (12-14) 


As compared to the abelian case, a new feature has emerged. For a constant 
g (or oa), A, transforms as a charged field belonging to the adjoint representation, 
as indicated by the second term on the right-hand sides of Eqs. (12-13) and (12-14). 

In classical electrodynamics, two potentials that only differ, locally, by a non- 
singular gauge transformation are physically equivalent, and are then characterized 
by the same field strength tensor F,,. We want to construct the analogous 
curvature tensor F,, in the nonabelian case. To this end, we consider the parallel 
displacement of some representative ¢ along an infinitesimal closed path C. After 
returning to the starting point, ¢ has rotated by 


giG- A) — P exp | dx Acc 
(@ 


If | is the dimension of C, we expand this element up to order /’: 


ci aAy=e+ | ix-a+| | dx2+* A(x) dx, ° A(x) + O(I) 
c Cue 


x > X] 


where, as before, the ordering implies that the curve has been parametrized with 
Xo = x(0) = x(1). We also expand A,[x(s)]: 


A,lx(s)] = Au(xo) + [x"*(s) — x00, A,(%0) + °° 
and get 


2 
where .% is the infinitesimal area bounded by C. We set 
F,,, = 0,A, — 0,Ay — [Ap Av] 
= (6, Ava — OyAya — C™aAppAve)t* (12-16) 


g(C; A) =e + all dx" 0 dx"(0,Ay — vA, — [Ap, Av]) + O(F) (12-15) 
Rod 


thereby generalizing the definition of electric and magnetic field. This tensor is 
often referred to as the strength tensor. A useful identity follows from the previous 


derivation: 
[Dis D,| = ay (12-17) 


In a gauge transformation, g(C ; A) transforms as 
g(C; A) > g(xo)g(C; Ag” *(Xo) 
[cf. Eq. (12-10)]. Hence, F transforms as a charged field belonging to the adjoint 
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representation 
F(x) > g(x)F(x)g" "(x) (12-18) 
or in infinitesimal form 
FoF +06F 
5F = [5a(x), F] (12-19) 


OF atx) = C sadagieeh ayer) 


In particular, the covariant derivative of F reads 


Dia Ft TD aE ,,| 80, — (Ae | 


In electromagnetism, the exterior derivative of the differential form 4,(\)dx". viz OE gl OURO RA 
closed form of degree two: 


6,F,, + cyclic permutations = 0 (12-20) 


This property is equivalent to the homogeneous Maxwell equations div B= 0, curlE + €B cr — 0. 
Reciprocally, given such a closed form, Poincaré’s lemma entails the local existence of a potential A 
from which F is derived (Sec. 1-1-2). The situation is slightly different in the nonabelian case. There 
does exist an analogous identity 


[Dus Pop] + [Dy Fou] + [Dp Fur] = 0 (12-21) 


as a consequence of Jacobi’s identity and Eq. (12-17). Notice, however. that Eq. {12-21) assumes the 
existence of A, as it appears in the covariant derivative. Moreover, one can show that if F,, and 4, 
satisfy (12-21), F is not necessarily the strength tensor associated with A. Accordingly, in contrast 
with the abelian case, the field strength tensor F does not determine uniquely all gauge-invariant 
quantities. 


If F,,, vanishes in the neighborhood of a point, A, is a pure gauge 
Fyy = 04> 3g(x): Au) = [8,900] 97 208) (12-22) 


Indeed, if F = 0, the integral of A, along a path C from the origin to x does not 
depend on the curve 


g(x) = g(x,0; A) = Pexp fl ax- Ab | 
Cc 


by definition of F. This element g(x) satisfies (12-22). Conversely, a pure gauge 
A, has obviously a vanishing F. 

Owing to the gauge arbitrariness, we may sometimes demand that the potential 
locally satisfy a definite condition. This is referred to as a choice of gauge. For 
instance, let n“ be a fixed four-vector. There exists a gauge transformation, A > 4’, 
such that 


n-A(x)=0 (12-23) 
This is the so-called axial gauge. 
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To show that this is possible, let us introduce a four-vector N# such that N-n + 0 (for instance, 
N = n, if n? # 0). Every point x may be written in a unique way: 


x= A(x)n+ x, xy = 0 


with es eal xX, =xX-—n 
rh Soa! 
Consider the segment C, x(s) = sA(x)n + x (0 < s <1), and the integral 


ax a) 
g(x) = Pexp {| ds Pe Alsi} = Pexp | dan: A(An + | 
0 


0 


Introducing 


Aj, = (6x9~")g + 97 * Aug 
we observe that 
ne d= (n-cg™'\g+ go 'n- Ag=g \n-A—n-a)g 
From the definition of g(x), we have 


=| 


I” = n-0g(x) = n- A(x)g(x) 


which implies n- A’ = 0. 


Similarly, we may show that through a gauge transformation we can always 
fulfil locally the Lorentz gauge condition 


0-A=0 (12-24) 


or any condition obtained from (12-23) or (12-24) by replacing the right-hand side 
by a function taking its value in the Lie algebra. 


12-1-2 Classical Dynamics 


Our aim is now to define a gauge-invariant action. As far as the coupling to the 
various multiplets of charged fields (matter fields) is concerned, it suffices to use 
the minimal coupling prescription. We substitute everywhere the covariant 
derivative for the ordinary one: 


ae Dita A, (12-25) 


where A, = A,,7T“ is understood to act in the representation of the matter field. 
The part of the action depending on A only must be a Lorentz scalar, gauge- 
invariant quantity—at most quadratic in the derivatives. The only candidate is the 
trace of F,,,F%” in some irreducible representation. For a simple Lie group traces 
in different irreducible representations are proportional since there exists a single 
quadratic invariant in the Lie algebra. We pick, for instance, the fundamental 
representation (of smallest dimension) and normalize its generators according to 


tr (t*t”) = —46” (12-26) 


where the minus sign is related to our convention that the t are antihermitian. For 
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instance, in SU(2) and SU(3), we take respectively 


ig 
—_— = iF 2 5 
5 a 
iA? 
d t= — = |e 
an 5 a 
The action pertaining to A reads 
1 4 m I 4 u 
= — y= — 5) d x leet (12-27) 
: ae [a i am | ~ 


The dimensionless parameter g, not to be confused with a group element, 
plays the role of a coupling constant. This is evident after rescaling A into 


A-gA 


jas ‘ | d*x tr (FyyF*”) 


1 
where F,, = 0,A, — 0,A, — g| Ay, Av| = ——[D,, Dy 
ue Be ue gl u | =e B | (12-28) 


Dy = On — GAn 


For the sake of brevity, however, we will not perform this rescaling until we 
derive the Feynman rules. 

For a general invariance group the Lie algebra is the direct sum of simple Lie 
algebras, plus abelian factors. To each of these terms a quadratic invariant and 
an independent coupling constant may be associated. An example is provided by 
the Weinberg-Salam model for the weak and electromagnetic interactions, based 
on the group SU(2) x U(1), with two coupling constants related to the Fermi 
coupling and the electric charge (see Sec. 12-6 below). 

The classical equations of motion are readily derived from the stationarity 
condition of the action (12-27): 


2 
0= 6] = ra | d*x tr [5A’(O"F,, _ [ A’, F,,y|)] 
Hence 
De. Fil = Clb, _ [Ae | es =) 
, (12-29) 
or Cig es Fuyp =0 


which provides a nonabelian generalization of the Maxwell equations. The non- 
linear character of the equations (12-29) makes their resolution nontrivial. 


These equations possess the desired covariance property. If A, is a solution, so are its gauge 
transforms. Also, it must be clear that the system (12-29) is a compatible system. In particular, the 
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contraction with 0, gives zero: 
OO"F,, = O° A", Fy] 
= [0"A", Fyy] + [A*, °F] 
= (0°A", Fv) + [44.04% Fed] 
= $[0"A* — 6A” —[ A’, A*], Fy] = 0 


a g = 1 the canonical energy momentum tensor is [compare with Eq. 
(1-105) 


~ 


©” = 2 tr (F"°0"A, — 4g’ F°F,,) 


but is not gauge invariant. This may be cured by subtraction of a total derivative 
7h aaa 


AG” = 2 tr [F#(0,A” + [A’, A,])] = 20, tr (F*?A”) 


where use has been made of the equation of motion (12-29): 
©” = 6” — AO” = 2 tr (F"°F’, — 4g""F °F.) 
= ) (PEF 2 kg" FF, oa) (12-30) 


We introduce the analogs of the electric and magnetic fields 
Ber, Big = —46in.F*, tek ato. 3 (12-31) 


where, as throughout this chapter, the indices i, j, k are space indices (i, j, k = 1, 2, 3), 
while a, b, c are indices of the Lie algebra. In terms of E and B, 


1 
©° = 5 > (Ea* Ea + B,° B,) = —tr (E? + B?) 
a (12-32) 
@% = ) (E, x B,)' = —2 tr (E x B)! 


12-1-3 Euclidean Solutions to the Classical Equations of Motion 


The search for classical solutions is motivated by the belief that a semiclassical approach may shed 
some light on the underlying quantum world and that classical configurations of fields that make the 
action stationary play an important role. 

Especially interesting are nondissipative configurations with a finite energy, i.e., such that their 
energy remains localized in a finite spatial region and is not radiated to infinity. Such objects are 
candidates to describe extended systems at the quantum level. These are coherent states of the 
fundamental fields, provided they are stable against decay. Stability may follow from some conservation 
law, perhaps of topological nature. These systems have received the name of solitons or energy 
lumps. As they arise from an expansion about a nontrivial stationary point of the action, these lumps 
and their quantum excitations exhibit features that could not have been suspected from the ordinary 
perturbative expansion. We will examine in Sec. 12-5-3 an example of a four-dimensional gauge 
theory involving scalar fields, which possesses finite energy solutions. On the other hand, it is possible 
to show that nontrivial finite-energy nondissipative solutions do not exist in nonabelian gauge 
theories with gauge fields only. In other words, any solution of this sort is equivalent to A* = 0. 
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Consider the Lagrange function for a time-independent solution, i.c., the space integral of the 
lagrangian 


1 
babylon | x (Ba B.— BB 


with the notation of Eq. (12-31). The total energy H is the sum L, + L2; hence its finiteness implies 
the finiteness of L,;, Lz, and L. A solution if it exists is unstable under the scale transformations 


Aa(x) > pAAg(Ax) 
Al(x) > LAL(AX) 
The Lagrange function is transformed into 
L— p?aL, — AL 
but it should be stationary at p = A = 1. This implies 
L,=1L,=0 
Hence 


F =0 


and, by a global extension of the local statement of Eq. (12-22), this means that the gauge field is a 
pure gauge everywhere. This scaling argument (due to Coleman) may be shown to prevent such time- 
independent solutions in any space dimension different from four. It has been extended to more 
general nondissipative configurations. 

There exist, however, nontrivial four-dimensional euclidean solutions to the classical equations. 
Before explaining the nature and role of these euclidean solutions, let us analyze further the structure 
of the ground state in nonabelian gauge theories. 

To carry out this analysis it is convenient to impose the gauge condition Ay = 0. Classically the 
ground state must correspond to time-independent field configurations of vanishing energy density. 

We have therefore F,,, = 0, which means that the field A is a pure gauge 


A(x) = O[g(x)]97 *(x) (12-33) 


Furthermore, we assume that we may restrict ourselves to transformation functions g(x) that have 
the same limit in all spatial directions. We may take this limit to be the identity in the group 


g(x) ——e (12-34) 

|x|—-00 
(there is actually no very convincing argument to justify this restriction). Under these circumstances, 
all the field configurations of the form (12-33) and (12-34) may be regarded as describing a ground 
state. We may wonder whether all these copies of the vacuum are equivalent, i.e., whether there exists 
a continuous gauge transformation vanishing at spatial infinity that connects any two of them. The 


surprising answer is generally “no.” Suppose for definiteness that the gauge group is SU(2). Any matrix 
of SU(2) may be parametrized in the form 


g(x) = up + iu-o (12-35) 


in terms of the Pauli matrices, and with w real satisfying u3 + u? = 1. Hence, SU(2) is isomorphic to 
the three-dimensional sphere S3:u5 + uj + ug +u3=1. On the other hand, the whole three- 
dimensional space with all points at infinity identified is also topologically equivalent to $3. Therefore, 
the gauge transformation g(x) associated with each vacuum is a mapping from S;3 onto S3. According 
to homotopy theory, such mappings fall into equivalence classes. Two mappings x — g;(x) and 
X + g2(x) belong to the same class if there exists a continuous deformation from g;(x) to g2(x). In 
the case at hand, the classes are labeled by a positive or negative integer called the winding number 
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or Pontryagin index of the class. This integer characterizes the number of times S; is mapped onto 
itself. It is equal to 


‘ 
n=>5 | d>x det {Ak(x)} (12-36) 


with iA%(x)o given by Eq. (12-33). Examples of representatives of the class n = 1 are 


o°x x? — 1 
gi(x) = —exp (i 2.) or gis) = + 2) 
vee x +1 
and g,(x) = 91"(x) belongs to the nth class. In the case of other simple groups, the same conclusion 
holds, namely, that there exists a discrete set of inequivalent vacua |n> labeled by an integer n. 

As explained in Chap. 11, such a degeneracy of the ground state is intolerable and is actually 
cured by quantum tunneling. The true vacuum is a linear superposition of the degenerate approximate 
vacua |n). Since the above gauge transformation shifts the winding number n by one unit, and since 
the true vacuum must be invariant—up to a phase — under any gauge transformation, it has the form 


o-x 
se th | 


|0> = 3 e** In» (12-37) 


where @ is a new arbitrary (and unexpected!) parameter in the theory. 
We may then try to understand better how tunneling takes place between the degenerate vacua 
|n>. To this end, we recall the Feynman-Kac formula [Eq. (9-198)] 


<nz|e7 7" |ny> = | QG(A)e~? (12-38) 


where H is the hamiltonian of the system and J is the euclidean action between times 0 and T: 


1 T 
=| dt | dx Fr", 
49° Jo 
1 


T, 
=— | dt | d°x(E.°E, + B.°B,) (12-39) 
29° Jo 

We have extended the definition of the strength tensor in euclidean variables. The functional integral 
in (12-38) is restricted by the boundary conditions on the winding number n(A): 


f n[ A(x, t = 0)] = my (12-40) 
n[ A(x, t = T)] = n2 


We are slightly cheating here, since the measure (A) has not yet been properly defined. Its precise 
definition will be clarified in the quantization procedure of Sec. 12-2. We see now an alternate 
explanation of the classification of ground states according to homotopy classes of $3 + $3. For T> o, 
we look at those configurations the neighborhood of which yields a finite contribution to the Feynman- 
Kac formula. Since their euclidean action is finite, F,,, must vanish at infinity in all euclidean directions, 
which amounts to saying that A, is a pure gauge field and yields a mapping from S3, the surface at 
infinity in a four-dimensional euclidean space, onto the group, for example, SU(2) ~ S3. Moreover, it 
may be shown that the winding number n associated with this mapping may be written as the 
euclidean integral 


1 
n= ——\ | d*x etPoFuy, Fee, = 


~ 64x? 32n? 


| do EEGEe”, (12-41) 
where the dual tensor is F4” = 46""°°F°*,. We observe that 
gr rien Se 


K* = 287A" (0° A", ~ S8abc A”, Az) 


(12-42) 
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For very large T, we expect the integral (12-38) to be dominated by the neighborhood of 


5) 


stationary configurations which are solutions to the euclidean classical equation of motion 
Dy F¥, = 0 (12-43) 


satisfying the boundary condition (12-40) or, equivalently, such that 


1 po 
Sine n= | da ere, (12-44) 
n nz ny 3272 | x 


Such a solution has an action bounded from below in terms of n. This comes from the positivity of 


0< [ats (FY, + F#”,)? = 2 | ats fla F*Y, F*] (12-45) 
1 4 wy \2 1 4 vy Puy 8n* | n| 
= ee aoe eve nee (12-46 
whence I ag? {4 x (F*,) ae x a ) 
The inequality is saturated by the self-dual or anti-self-dual configurations F = +F, since the 


equations of motion are then automatically satisfied as a consequence of the identity (12-21). 
Explicit solutions of arbitrary winding number have been recently discovered. The solution 
proposed by Belavin, Polyakov, Schwartz, and Tyupkin for nm = +1 reads 


2 


x -1 
A, = Fak {8,9(x)]g~ *(x) (12-47) 


A gists Oe 
with ~ g(x) = “ey 


Clearly, at infinity A, reduces to a pure gauge, arising from the identity mapping from S$; onto $3. 
We thus expect and may check by a direct computation that this solution has n= +1. More 
generally, if we use the parametrization 


Au => (Ea: Op + 5ai00) In f L k= 1, oa 


(12-48) 
A®, = te, In f 


the equation F = +F reduces to 


1 
j Orf=0 Os=0§+V (12-49) 


The case n = +1 corresponds to f") = 1 + 4*/x* but other solutions corresponding to a winding 
number -tn may be devised: 


n+1 43 
i 


f'~= y —; (12-50) 


i=1 (x— xi)? 


They depend on n + 1 arbitrary scales 4; and positions x;. Such solutions have been called pseudo- 
particles (by reference to the imaginary time coordinate) or instantons (due to their local character 
in time, as compared to the three-dimensional solitons). It is known that further solutions must exist. 
For SU(2), for instance, the general solution depends on 8n — 3 parameters up to gauge transformation. 
We will not dwell on this rapidly evolving problem, nor on many other related topics such as the 
modifications to be brought to this picture in the presence of massless fermions. 

Although fascinating, these global properties will be omitted in the following. This is legitimate 
because we shall concentrate on the perturbative expansion, which is insensitive to the choice of the 
vacuum we expand about. 
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12-1-4 Gauge Invariance and Constraints 


The equations of motion are insufficient to determine the fields A,(x), given a set 
of Cauchy conditions at a time to. Two solutions obtained from each other through 
a gauge transformation g(x) such that g(x) = e for t < fo satisfy the same Cauchy 
~ conditions but may differ for t > to. Therefore, the gauge arbitrariness has to be 
restricted through the introduction of auxiliary conditions which do not affect 
gauge-invariant physical observables. The following discussion, due to Faddeev, 
will lead us to the functional quantization outlined in Chap. 9. 
We restrict ourselves to a simple compact Lie group. The action, as in the 
abelian case, is first rewritten in terms of independent variables F and A: 


1 
Tee [ets tr [(@yAy — @yAy — [Ags Av) FY’ —4 FF] (12-51) 
z 


By variation with respect to F and A, we obtain respectively (12-16) and (12-29). 
We introduce the notations E and B of Eq. (12-31) and make use of the time- 
independent relation between A and B to eliminate the latter. In the sequel, B will 
stand for B(A). We have 


bs 
je = | d*x tr [(€°A + VA° — [A°, A]):E + 3(E? + B’)] 
Fe 


. 
nn | d*x tr [@°A +E + 4(E? + B?) — A(V-E+[A,E])] (12-52) 
g- 


after an integration by parts. Since 4(E* + B’) is the energy density, we face a 
typical problem of constrained systems. The canonical variables p and q are here 
A/g and E/g. The variables A®° play the role of Lagrange multipliers for the 
constraints 


T(x) = V-E+[A,E] =0 (12-53) 


which are the v = 0 components of the equations of motion (12-29). Owing to the 
diagonal metric (12-26) (which enables us to have completely antisymmetric 
structure constants Cyr), We write the equal-time Poisson brackets as 


{ Ai(x), Ei,(y)} xo=Yo — g? 545 95° (x a y) (12-54) 
We also need the brackets {H, I} and {I, } where 
i 53 | d°x ) (E2 + B?) (12-55) 
g a 
Therefore 
{E,(x), F(y)}x0=50 = g?[TE(o), t7]5°(x — y) 
{A,(x), T(y)} x02 yo = a 1Ds t*d°(x me y)| 


where D, stands for V, + A(y), and the commutation is understood for the 
derivatives and for the Lie algebra as well. On the other hand, ina time-independent 


(12-56) 
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infinitesimal gauge transformation, E and A transform as 
OE, (x) = CabcO0p(X)E-(x) 


= =( i. d?y {Bao sash iTa0) 
6Aq(x) = — Voota(x) + Caved%o(x)Ac(x) 
I 


az ge | d*y jal 3 soa itao| 


Hence, the I are the infinitesimal generators of time-independent gauge trans- 
formations in this hamiltonian formalism. Without any further calculation, we 
conclude that 


{Te a6 = g? Copelie(porx — ) 


(12-57) 
{H, T,(x)} = 0 
The equations of motion read 
OE! = (curl B)' + ¢[ A’, B*] + [A°, E' 
oF! = (curl BY! + ep[/, BY} + (4°, E' ott 
@oA = —E— VA° +[A°, A] 
and the constraints . 
V-E+[A,E]=0 
(12-59) 


B' = (curl A)! + 46;,[ A, A*] 


The observables are functionals of E and A, restricted to the manifold (12-59), 
such that their Poisson brackets with the latter vanish on that manifold. As a 
consequence they are invariant under time-independent gauge transformations. 
Such is the case of the hamiltonian density, for instance. 


12-2 QUANTIZATION OF GAUGE FIELDS 


The next step is to quantize the theory and to derive the perturbative rules for 
the computation of Green functions. The physical interpretation, which was a 
valuable guide in the case of electrodynamics, is missing here. In particular, we 
do not know whether asymptotic massless states, similar to photons, will emerge 
from the quantization. 


12-2-1 Constrained Quantization 


The method of Sec. 9-3 is basically hamiltonian, and dynamical variables have 
been chosen in a noncovariant way. Nevertheless, we hope to end up with covariant 
expressions. 
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The r constraint equations 
lP=V-E+[A,E]=D-E=0 (12-60) 


(ris the order of the Lie group) have vanishing Poisson brackets with H or among 
themselves, on the manifold (12-60), as prescribed in (9-152) and (9-153). Two 
pairs (A, E) satisfying (12-60) are equivalent if they lie on the same trajectory of 
the flow (9-142): 

dA dE 

eS {T, A} — = {IE} 
that is, if they are obtained from each other through a time-independent gauge 
transformation. We then have to introduce an auxiliary condition which selects a 
unique representative in each equivalence class. 

It is always possible to perform a gauge transformation such that everywhere 


A? =0 (12-61) 


[see Eq. (12-23)]. These r auxiliary constraints of the axial gauge define a unique 
element on each trajectory, if we assume that the fields vanish fast enough at 
spatial infinity. Since gauge transformations which preserve the condition A? = 0 
are independent of the variable x? and must reduce to the identity at infinity, 
they do so throughout space. This condition also prevents from performing any 
global transformation. 

We have next to compute the Poisson brackets between the I’ and the 
auxiliary conditions. The advantage of the axial gauge (12-61) is that these brackets 
do not depend on the dynamical variables E and A. Indeed, {T, A*} is an 
infinitesimal gauge transformation of A®, but restricted to A* = 0, and does not 
depend on A or E. The determinant of this bracket that appears in the path 
integral (9-159) may be absorbed in the normalization, and will not be written in 
what follows. The generating functional of the Green functions reads 


e610 = | OE, A, ATT 5A 
x exp e | d*x tr [C°A- E + 3(E? + B?) — A°D-E— gA A} (12-62) 
g 


We have added a source term for the four components of A“, although A° 
vanishes and A°® may be integrated explicitly. 

The introduction of a source term enables us to question the system. Some 
answers such as the Green functions may depend on the choice of gauge and are 
just an intermediary step to get physical information. It would be tempting, of 
course, to consider directly gauge-invariant quantities such as the S matrix. 
Unfortunately, the asymptotic states are unknown and any computation of the 
would-be S-matrix elements is plagued with severe—though interesting —infrared 
divergences. Moreover, the theory requires renormalization, and an algorithm for 
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eliminating ultraviolet divergences which does not rely on the Green functions 
remains to be found. 

Since E appears only quadratically in the exponent of (12-62), the gaussian 
integration may be carried out: 


eG) — | au [| 64°) exp} 5 {a (ey — 29% as) 
x Y 


L= ttt (FY Fy) 


(12-63) 


The subscript A of G4(J) stresses the fact that Green functions depend on the 
choice of gauge. In spite of the simplicity of this axial gauge, the Feynman rules 
that we may derive are not Lorentz covariant. It is therefore natural to consider 
more general conditions. We shal] proceed in several steps. 

We first study another noncovariant gauge, the Coulomb gauge, defined by 
the auxiliary condition 


div A= 0 (12-64) 


As stated above, it is always possible to find a local gauge transformation so as to satisfy (12-64). 
This condition was generally considered as determining uniquely the gauge transformation g. Phrased 
differently, if div A = 0, it was traditionally stated that the solution in g of div (°A) = 0 reduces to 
the identity under suitable conditions at spatial] infinity. This is what happens in the abelian case. If 
div A = 0, div A’ = O with A’ = A + WA, the harmonic function A vanishes at infinity and hence every- 
where. However, as pointed out recently by Gribov, this is no longer true in the nonabelian case. The 
equation for g involves A and admits solutions for A large enough. Given A such that div A = 0 let 
us look for a time-independent infinitesimal gauge transformation 4" = 4' + [éx, A‘] + ¢‘éx such that 
div A’ = 0: 


0(0'6a + [da, A}) = 0 (12-65) 
For A small enough (equivalently, for small enough coupling constant) we may expand 
ba = dol!) + g5olt! + --- 
and Adof™ = 0 Adal") — 8;[5o8, A™] = 0 etc. 


If ~ is assumed to vanish at infinity, so do all the x“). There is no nontrivial solution. However, 
Eq. (12-65) may be regarded as a Schrédinger equation. For a potential A large enough, it can be 
checked that bound states do exist, i.e., solutions of 


Aa + O;[ a, A;] = Jie 


with E <0. Therefore, for intermediate magnitudes of A, there must exist zero-energy solutions of 
fast decrease at infinity. 

This objection seems embarrassing for our program, since the comparison of the quantization 
in different gauges relies on the assumption of uniqueness of the transformation which relates them. 
However, since this phenomenon appears for large values of the field, it does not affect the 
construction of the perturbative series which is by essence a small-field (small-fiuctuation) expansion 
about a given classical configuration. We shall henceforth discard this Gribov phenomenon. Whenever 
we talk of uniqueness of the choice of gauge (12-64), we understand it in a perturbative sense. 


The auxiliary conditions (12-64) have a nontrivial Poisson bracket with the 
constraint 
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{div A,, F,()}.,=y» = ee div A’,(x) 
dae(¥) 
= [~SapAx + CareVs* Ac(x)]6°(K—y) (12-66) 
We introduce the operator ./: 
May(X. ¥) = [—Axdap + Cave Vx * Ac(x)]°(x — y) 
= [—Axdap + CareAc(x) * Vi16°(x — y) (12-67) 


where the last expression holds when ./ is restricted to the constraint manifold. 
In the Coulomb gauge, we thus write the generating functional of Green functions 
as 


eG) — | Gd) |] et MV: a] exp {os [ats [LY —2gtr(J- 4} (12-68) 
Through a modification of the normalization in Eq. (12-68), we may replace 
det .@ by det “&.M5' where 

Mo = — A6;;5°(x — y) 


Using det 4M. * = exp [tr In (.@.W¢')] and tr In(1 + A) = ¥ [(—1)""*/n] tr A" 
we may derive the Feynman rules in this gauge. However, they are not covariant 
either. 


12-2-2 Integration over the Gauge Group 


The gauge transformations used so far were time independent. In order to make 
the Lorentz covariance manifest, it is more appropriate to consider time-dependent 
transformations as well. The method to implement them will appear as a by- 
product of a different problem. How can we show the equivalence between the 
axial and Coulomb gauges? 

Let us drop the coupling to the external source J, replace it possibly by 
sources coupled to gauge-invariant quantities O,(x), and consider the functional 
integral (at a given time) 


aes | F(A, E)T] 5(A3)u(A, E) (12-69) 


where p is the gauge-invariant functional 
pA, E) exp ‘i | d*x [ L(x) + Jeon 


In a gauge transformation 
A~>(A)= —(Vg)g"'+gAg* E> (°E)= gEg™' (12-70) 


where g is considered as a product | |, g(x) and y is invariant. So is the measure 
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Y(A, E), since the gauge transformations are canonical. Hence 
), | QA, E) I] 63(°A>)u(A, E) 


As the conditions V- A = 0 or A? = 0 are two equivalent ways to define uniquely 
(in a perturbative sense) a point in each equivalence class, there must exist a 
go(A) such that 


oA7-=0=>V-A=0 (12-71) 
Let us compute the jacobian of this transformation. We set 
A (A) = | Bg) | | 6(V- 2A) (12-72) 
where Y(g) stands for the infinite product of invariant measures on compact 
groups, isomorphic to the group G at every space point x, Y(g) = Tle Dg(x). 
The invariance of the measure Dg’ = D(gq’) entails 
A(9A) = A(A) (12-73) 
We then multiply Eq. (12-69) by 


1 =A(A) | R(g) I] 0(V- 9A) (12-74) 
interchange the order of integrations, and use Eq. (12-73) to write 


X= | D(A, E)u(A, E) T] 6[.43(x)] | Hg) {LV -?AQ)] AA) 


= | (A, E)u(A, E)A(A) | | O[V- AQ)] \ Zig) fof? A2(x)] (12-75) 


Owing to the invariance of the measure “(g), we may change in the last integral 
g ' into ygo, where go is such that %A* = 0. For simplicity, we denote B= %A 
and write 


| P(g) [  d[% A(x)] = | Bg) |] 6[9B7(x)] 


Since B* = 0, it suffices (at least within perturbation theory) to consider only 
infinitesimal transformations g and to perform the group integration in the 
vicinity of the identity 


g(x) = e + a(x) 
where a is infinitesimal. Under these circumstances, the measure at each point 
Dg(x) reduces to the product | |; da,;(x) and 
Oa(x) 


2B (x) =n fe: 
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Hence the integral 


[oom [one 


is independent of A and may be absorbed in the normalization. Therefore with 
an overall y-independent normalization factor N, we have 


en | D(A, E)A(A)u(A, E) [] o[V- A(x] (12-76) 


To recover (12-68), we have to show that A(A) is proportional to the determinant 
of the operator . # defined by (12-67). Since A(A) is multiplied by 6(V- A) in (12-76), 
it is sufficient to compute it for transverse fields A. Then, only aaRTeNele g 
contribute to the integral defining A: 


A-A) = | (9) TL OL¥- °A(x)] 


_ | 2@ I] 6(V + {—Voa(x) + [o(x),A(x)]}) 


= det”! a (12-77) 
with, as in (12-67), 


Mar(Xx, y)= =—[V- “Ad(x)]| 


a 


We have just shown the equivalence of the axial and Coulomb gauges, as 
far as the computation of gauge-invariant quantities (A, E) is concerned. What 
happens to the source term, and hence to the Green functions, has not been 
examined, However, only local and canonical changes of field variables have been 
performed in the previous derivation. By virtue of the equivalence theorem 
mentioned in Chap. 9, we do not expect these transformations to modify the 
physical content of the theory. 

The previous method is interesting, as it allows us to deal with time-dependent 
gauge transformations and to impose covariant auxiliary conditions. Let 


F(A)=0 (12-78) 
be such a condition. ¥ is a local functional of A, that is, a function of A(x) and 


of its derivatives, it takes its values in the Lie algebra, and it may depend on A’. 
Using the same method as above we find 


X= | (A) [] o[F(A)] det Mep(A) 


up to a normalization. We are thus led to consider Green functions generated by 
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ofa) r= | 9a) [107d ayjderazern [i | atte - 29-0] (12-79) 


A5'(A) = (det Ms) ' = | a [| 64 (?A)] (12-80) 


computed for #(A) = 0. Only infinitesimal transformations contribute to the 
integral and 


Ma(x, y)= 5a sa) F[PA( O]) 


6 [ 6F ee 
~ 6a5(y) Ee vaes,, (12-81) 
OF, 


ee YD at el ee 
6 A#.(x) DE cbO (x y) 


where Dect = On Ocd So Cera AX a(x) 


For instance, if we use the Lorentz gauge 
F(A) =0C,A* =0 
the operator -@ reads 
M1, ar(X, Y) = OD" 5*(x — y) 
= [das + Carcd# Aye(x)]6*(x — y) 
= [Liban + Cade Apel x)O%,]54(x — y) (12-82) 
Returning to the axial gauges 
a= A= 0 


we see that .W is independent of A, when restricted to the constraint manifold; 
hence det .W may be absorbed into the normalization. 


Instead of relying on the canonical hamiltonian quantization and on the subsequent functional 
manipulations, we might decide to modify the ill-defined integral 


= | 2m 


through the insertion of the identity written as 
= | 9(g)A(A) [] 6 F(2A)] 


The gauge invariance of s(A), of A(A), and of the measure 2(A) would then enable us to factorize 
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an infinite group volume 


X= || 20] DA) AVA(A) T] OE F(A)] 


This crude argument has the merit of exhibiting clearly the enormous degeneracy of the degrees of 
freedom as the origin of the problem. 


The previous developments are readily extended to auxiliary conditions of the 
form 


aA) = C 
where ¥ and the given function C(x) take their values in the Lie algebra. This 


modification does not affect the form of the operator ./ in (12-80). 


The variations (3 dg)[F(?4) — C] do not depend on C, and the only dependence of Ag on C comes 
from go, the gauge transformation such that ¥(% A) = C: 


A ¢{A, C) = A(%A) 


We may write 
1= | D(g)A(®A)5[ F(A) — C] = | D(g)ACA)S[ F(A) — C] 


where the second equality holds because the 6 function implies g = go. Any reference to C has 
disappeared. 


Since gauge-invariant quantities should not be sensitive to changes of auxiliary 


conditions, it is possible to average over C with a gaussian weight, ie., to substitute 
for 6[ F(A) — C] the quantity 


es , - 
\ FC) exp E {a tr cc fotze — C] exp a | d*x tr [F Oh 
In its final form the generating function reads 
eG) = few det Wz exp (4 fas {P(x)+Atr[F7(A)] — 2g tr: 4) 


(12-83) 


The perturbative expansion of det .@ leads to nonlocal interactions between 
gauge fields. It is more convenient to perform a final manipulation on this 
determinant, and to reexpress it as a local interaction of fictitious fields. The 
discussion of the integration on a Grassmann algebra has yielded the formula 
(9-76): 


det M = | [] 4a dm, e-7- 
k 


where .@ denotes an n x n matrix. Changing -@ into i.W and extending this 
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result to an infinite algebra, we may write the determinant det Wgasa functional 
integral 


es) =(9 DA, n, ivewr(5 ie [P= 2 [(P*A)| — nay — agit JA) 
(12-84) 
The modified or effective lagrangian 
L AA, N, i) = L(A) + Atr F(A) — 7My (12-85) 


involves the gauge field A and the new anticommuting auxiliary scalar fields y 
and 7, the so-called Faddeev-Popov ghosts. We emphasize that these fields are 
unphysical and only play an algebraic role. To insure global invariance, 4 and 
4 transform according to the adjoint representation. The ghost term in the 
lagrangian (12-85) reads 


oF a 9 A( 
[as 7M gn = [fats a*y ia{x) [ een 


zi atx fe) 29 Dison) ai 


(D"n)a = O’Na so Carchp A". 


In general, the kernel .@ zis not hermitian; therefore the ghost lines in a Feynman 
diagram will have to be oriented. For instance, if # is the Lorentz covariant 
condition ¥ = 0, A", we find 


[as nMn = [as NOs aly (12-87) 


In this case, the lagrangian of Eq. (12-85) reads 
Log =F tt (FF) + Atr (0: A? — 90,D"y (12-88) 


The parameter 4 (also denoted «~' or €~! in the literature) reminds us of the 


arbitrariness of the auxiliary condition. The choices 4 = 1 or 4~ | = O are referred 
to as Feynman or Landau gauges respectively. 


12-2-3 Feynman Rules 


Equation (12-88) gives the desired solution to the quantization problem, since it 

provides a local Lorentz covariant expression for the effective lagrangian. The 

quadratic part in A is invertible as a consequence of the fact that the condition 
= C has picked a single representative in each equivalence class. 

We are now in a position to write the Feynman rules. We rescale the fields 

A and n,n by g, the coupling constant. As the Feynman diagrams also include 

ghost fields, it is helpful to introduce (anticommuting) sources €,€ coupled to 
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n,n. We have thus 
PAC Uatd am | 2. 4.) exp [ar [LP a(A, 1,139, 4) 
+ J"gAua + ata + nets} (12-89) 


= A 
f(A, 959, 4) = —dFwvak a — 5 (OuA"a)” + OyHa(D"n)a 


pope A OATS — gCucA pA’. (12-90) 
(D"n)a = O"Ng ae? 9Carc A" Ne 


The propagators of the gauge and ghost fields are respectively 


k kik 
f = = - 1 a, rt F F) af ae iba Guy = en = il _ aia 
; > ¢ 1 if ( )0®@ I} l 12 Ge (1 A ie Te =| 


(12-91) 
Qde--»--« b (iO)! id ab 


k ih +ie 


5 


and behave as k ~ at large momentum. Therefore, both fields are given the 
dimension one for the ultraviolet power counting. The coupling constant g is 
dimensionless. There are three kinds of vertices. If we orient the ghost lines from 
the 7 to the 7 (as we do for genuine fermions) and include the factor i from the 
expansion of exp [i [ d*z Y,,,(z)], we get 


gCave(27)*5*(p + q + N)[gulP — Do + Grol4 — Nu + Ioult — Pv] 


—ig?(2n)*5*(p + qtrt+s) 
x {CeabCecd(GupGva — JucIvo) 
+ CeacCeab(Gue9 pv — Jur9 oa) 
+ CeadCebe(JuvIap — JuoJov)} 


(12-92) 


Pp 
A 
4" yb pc" 
Mea ae 
rae 
Sod pc " 


‘ 
— gCaveP u(27)*6*(k ae = q) 
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Notice the (expected) asymmetric character of the last vertex. With our 
convention the outgoing ghost line carries the momentum arising from the 
differentiation. For an actual computation, the above rules have of course to be 
supplemented by the prescriptions derived in Chap. 6, namely, integrations with 
the measure d*k/(2z)* of all internal momenta, factorization of the global energy 
momentum delta function, symmetry factors, and a factor minus one for every 
ghost loop. 

Let us complete the Feynman rules when matter fields are coupled in a 
minimal way. We add to the lagrangian (12-90) the terms 


L, = iWDy — mb 
or L, = (D"$)'(D,.d) — m?o'b — P(o') 


for fermion and boson fields respectively. Here wy and @ stand for multiplets of 
fields, transforming according to some representation R of the group, the 
infinitesimal generators of which are the antihermitian matrices T° and P is a 
polynomial. We recall that D* denotes 


D* = 6" — gA",T? 


(12-93) 


The additional Feynman rules are 


B P A i i 
3 ae ( - 7) gba ape wee 
k 
a I(Yu)ap Tis d 9TAB( Pn We Pu) 
an 
x (2n)*5*(p — p’ — k) x (22)*5*(p — p’ — k) 
Pp p’ 
B B A , (12-94) 

, Ha vb kV 


= ig? Juri ie T’\aB 
x (22)*6*(p — p' — k — k’) 


The first column refers to fermions and the second to bosons. In the latter case, 
additional vertices stem from the self-coupling polynomial P(¢‘@). 


For completeness, we also give here the Feynman rules in the axial gauge. To this end, we add to 
the lagrangian a term of the form (n- A): 


A 
Log = 2 |- tPsok, = ; nar? | (12-95) 


The axial gauge is reached by letting 1’ go to zero and, as shown above, no ghost term is required 
in this limit. 
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The corresponding propagator is 


Oe fie 
~~ [-i(0 ~8@8-An@n)]"! 
a 
k* + An?)k,k, k,n, +kn 
aes Suv ( Bey Mylly vily : 
| Ie(k- ny? eken | a 
Fr Juv — (kyny + kyn,)(kon)~) + n(k>n)~*k ik, 
ee er A aes 
A-' 0 ° k? 


The problems raised by the new type of singularity in the denominators will not be investigated here. 
Notice that the propagator behaves as k ~? only in the limit 4~ ' + 0. Finally, the 43 and A‘ vertices 
are the same as in Eq. (12-92). 


12-3 THE EFFECTIVE ACTION AT THE ONE-LOOP ORDER 


The previous Feynman rules lead to a theory renormalizable according to power 
counting, since all propagators behave as k~? and all vertices have dimension 
four. However, the problem is to show that gauge invariance is preserved by the 
renormalization. Before embarking on a long and rather technical proof to all 
orders, it is instructive to perform an explicit computation of the one-loop effective 
action. 


12-3-1 General Form 


To cope with the multiplicity of indices let us use a compact functional notation. 
As in Eqs. (6-73) or (9-107) we obtain the effective action through a Legendre 
transformation 


iT(A, n, 7) = GU, &, &) — i| dt +A + &n + #6) 
(12-97) 
6G 0G ._ 0G 
a id] i= i5z Aas i6e 
Lorentz and group indices have been omitted. The derivatives with respect to 
anticommuting variables are understood as left derivatives. In other words, we 
write 


where 


Gas, ¢) = [ars | ee09 RE + 5&(x) x 


6¢(x) d€(x) 
-- [ars d*y 5&(x)dE(y) eee + 
> 06(y)dC(x) 


This prescription is responsible for the minus sign in the expression (12-97) for 7. 
To lowest order, I’ reduces to the action 


P= Ami = [ax ZalAnia0 (12-98) 
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while the first correction ™"! results from a gaussian integration over primed 
variables of the action expanded to second order: 


a [as LA(A+A nt nt+739,4) 
We write the quadratic form explicitly as 
I, =2 [as tr |i. Ay] — (Dy, A;,])([D*, A’) — [D", A*)) 
—$ Pe Ay, Av] + 5 (0A)? — Oyff[D", 1] 
+ g6uHl Aun’) + 90,'L A”, nlf (12-99) 


where matrix notations are used, 7 = nt’, etc. This fairly complicated form mixes 
commuting and anticommuting variables. We recall the formulas 


n dx; aoe —1/2 ,u07'u/2 
: ~ (12-100) 


| Tl 4: dn; e~F4ntE-n+ae — det MeA 
i 


for gaussian integrals of commuting and anticommuting variables respectively. In 
the mixed case, we get 


dx; 2 " 5 “ z 4 
| Near [ | Gai ani) exp (—4xQx + fax + xBn — Fy + ux + Son + 4-8) 
= det mM [det 0]~* exp 4[u+ Ea-a 
+ (BMG Lu + (EM! a)? + B12] + EME, (12-101) 


The matrices Q and -@ have commuting elements, while those of the matrices « 
and f belong to the Grassmann algebra ; Q denotes the symmetric matrix 


O=Q—BM'a—(BM-%0)" 


Equation (12-101) results from the repeated application of the preceding formulas. An alternative way 
to recover it is to remember that the saddle-point method is exact for gaussian integrals. 


These formulas are applied to the quadratic form (12-99) where 
(QA')" = [D,, [D’, A] — [D", A”]] — oF”, A™] + 2040+ A’ 
+ g{o"n, n'} + g{0"n', n} (12-102) 
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and where 7’ and 7’ are obtained from the equations 
Mn’ = 0,| D",n'| = 90,4", n] (12-103) 
[D*, 0.77] = —9[04n, Au] 


If g = 0, .# and OQ reduce to 
Myo =O Q= U-(1—A)d@d 


the inverse of which are the free propagators. The one-loop effective action, 
normalized to '"(0) = 0 is therefore 


TA, A, n) = Tr [In (Mo*M — 4 1n (605) (12-104) 


The trace is performed on internal and Lorentz indices and on space-time 
variables. 

It is not possible to obtain a more explicit expression. As the matrices Q and 
M act in the adjoint representation, it is more convenient to consider A, and n 
as matrices in that representation [compare with Eq. (12-9c)]. Explicitly, 


(A")ic = AGT “te = CracAa 


(12-105) 
(1) be =a CraeNa 
Therefore, instead of the normalization (12-26), we have 
tr (T°T”) = -y CeaaCeab = — CO” (12-106) 
c,d 


For SU(N), C = N. With these notations, we write 


d*k e &* 1—*2) 


M5 MA, 2) = (1,2) — a | on pz ks Alo) 
d*k 
-1 —_ 


x {(—ik A + +A — gA*) [Gui — (1 — A Da eae (12-107) 
+ [k?A,, —(1 —A7*)k> Ak, V(iky, + gAu,) 
— [k?0,, -(1 —A~")k, k OJ Ay, 
+ [k?F yp. — (lL — 47) Fu, ok kyl} x, oF tire 
In the curly brackets, the argument of A or F is x1, and the derivatives act on 
all x,-dependent terms to their right. The kernel O differs from Q by ghost terms, 


not made explicit here. In the sequel, we use the same notation for a function and 
its Fourier transform: 


4 
Ao | HF e7 *** A(k) 


We shall now focus on the superficially divergent functions. 
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12-3-2 Two-Point Function 


From the expansion of Tr In (Wo '.@) — 3 Tr In (Qo 10) we get three contributions 
to the two-point function (Fig. 12-1): 


;| [dk] tr [A(T (kK) A(—K)] 
= r| [dk][dq] aie x tr [(q + k): Alk)q: A(--4)] 


4]! [ak] [aa] Rate (ad — DAIL) ACW 


+(1—A~*)[q: A(k)q: A(—k) — 7A —k)]} 
“franc cps tla + 9) Ala? - 1-2 Naa] 
—q@[q?A—(1—A~*')g: Aq] — A@[a?q —-(1-—47")q-¢'q] 


+ P(k@A-AQ@k)—(1—A- (kg? A — q@° kA) @ Qh 
x {qq + 4) Alg? —(1—471)q' @q']— 4g @[g?A- (1-4 *)q’ Aq’) 
—A@[q7q-—(l—A")g-d'q)] — g°"(k@A- A@Kk) 
+(1—A~*)[(kq’: A — k-'A) @ @']} -x (12-108) 
In the last trace, q’ = q +k, and the argument in the first curly bracket is 
k and in the second one —k. Tensor notations are adopted for the sake of brevity. 


A Wick rotation has been carried out on the time components of both momenta 
and vector fields: 


k°—ik® = g® -+ ig® A°(k) > iA%k) 


We use dimensional regularization to preserve gauge invariance and [dk] stands 
for the measure d*k/(2z)’. Needless to say, the above expression may also be 
derived directly from the Feynman rules of Eqs. (12-91) and (12-92). 

As explained in Chap. 8, it is consistent, within the dimensional regularization, 
to consider that the second integral of the right-hand side of (12-108) vanishes 


I II 
— -_— 
Sage =? 
(a) (0) 


Figure 12-1 One-loop contributions to the vector self-energy. The broken lines stand for ghost 
propagators. 
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identically. The ghost contribution, ie., the first term on the right-hand side of 
(12-108) is easy to compute in terms of Euler’s functions: 


; d d Qvdii— 9 ay 
Tonlk) = ae (3. )are- jaer( 5 ‘| = oar(2 = 3] (12-109) 


and after tedious algebra we obtain the total two-point function as 


fee. _@ ai ped apenas: dd d d d 
d d d ag od d 
lee! ae | et a ees =f 
cu-sfan(t—12)r(s—2)-20(4—14)r(2-4) 
woe d\esid 
(oe vrar(3—$)a($-1,$-1)h (12-110) 


In these expressions, ['(A) must be regarded as a matrix in the adjoint represen- 
tation, proportional to the unit matrix. 

The ghost contribution (12-109) was crucial to achieve the transversity in k, 
since separate contributions do not satisfy it. The expression (12-110) is suited for 
the extraction of the divergent part as d—4= —e-—0. Recalling from Chap. 8 
that g? should actually be written g*y’, where yp is an arbitrary mass scale, we 
have 


> 


» = : 
Loe (4x)? 


; ve 2 k 
T(k) (k° —k@k[§$+30-—47')] (- ‘ + In z + constant (12-111) 
where constant terms (i.e., independent of k but 1 dependent) have not been 
computed. This expression may be rephrased in an equivalent form, by writing 
explicitly the group indices and returning to Minkowski space. The two-point 


proper function for A*,(k)A”,(—k) reads 


2 2 k? 

Py = — Coy Lege — wus 4a — 4°) -2 +19 (-S)] era 
16x? é U 

It is now clear that the divergent term may be eliminated by the introduction of 

a counterterm: 


SL = (Zs — 1b te [(0,4y — 0 A,)(0"A" — 2° A9]} (12-113) 
2C 2 
where 7 ae +1 -2-)]= (12-114) 


To this order, the term 2/e may be regarded as corresponding to In CTO 
in a conventional regularization. The wave-function renormalization Z3 is gauge 
dependent. This may be traced to the fact that nonabelian gauge fields also play 
the role of charged fields. The fundamental consequence of the transversity of the 
function (12-110) is that no counterterm in A? nor (d- A)’ has been required. 
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Figure 12-2 Three-point function. 


The divergent terms in (12-110) come from the poles of the [ function. However, when the 
computation is carried out in a different way, they may arise as singularities of the B function. This 
is a reflection of the fact that in a massless theory, the distinction between ultraviolet and infrared 
divergences becomes loose. For instance, the term of the integrand of I” proportional to 
(1 — 47 ')6"" has the form 

(k? = q’y? 

q(q + k)* 


and leads, within the dimensional regularization, to an ultraviolet finite. but infrared divergent, 


integral 
id 2 A2\e dig(k?)* d d d 
(et ?) -| ata( Fn a(t 2 con) erm) 
q(k + q) q°(k + q) 2 Z 2/ 
However, after a change of variable, gq - g’ = q +k, this integral looks ultraviolet divergent but 
infrared finite: ~ 


dig ei (q’ - ee (5 d é d ‘dd ad 
—— « 4Bi-—1,-]T|3—~=]-2B =e == 
sil (q — kPq’4 2 2 ( 2 (33 2 } 3) 
Of course, the two expressions coincide, but it is important to pick all singular terms when expanding 
around d = 4. 


12-3-3 Other Functions 


For other superficially divergent proper functions we only give the structure of the 
divergent term. 

Three diagrams contribute to the three-point function (Fig. 12-2). The required 
counterterm reads 


6L43 = (Z; _ i —¢ tr {(0,Ay S 0,A,)[A*, A*}}) 


g°C ae (12-115) 
Zi=1+sea[s+al—4 ie 


mt pk xx 


Figure 12-3 Four-point function. 
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= Sa... —» Figure 12-4 Ghost self-energy to the one-loop order. 


Similarly, the diagrams of Fig. 12-3 for the four-point function necessitate a 
counterterm 


6 Lys = Ze) tay ALLA AD 

(12-116) 
2G 2 
Z.=14+25[-§4+0-a- y= 


The counterterms have the same form as the initial terms of the lagrangian. 
It is of no wonder that this is the case for 6Y4:, as the expression (12-115) is the 
only one which is Lorentz invariant, cubic in the field, of dimension four, and 
invariant under (global) group transformations. This is no longer true for the 
quartic term and the form of the counterterm (12-116) is therefore a gift. 

The counterterms pertaining to functions with external ghosts must also be 
computed. Thanks to the structure of the operator O defined in (12-102) and 
(12-103), or equivalently to the Feynman rules (12-92), the momentum of an 
outgoing ghost line may always be factored out. This reduces the effective 
superficial degree of divergence of functions involving ghost fields, and leaves us 
with only two divergent functions: the ghost self-energy (Fig. 12-4) and the ghost- 
vector vertex (Fig. 12-5). The former does not need any mass counterterm, owing 
to the above property, and we find 


L iin = (Zs = 1)(—1.0Na) 


a oe {1 1—A-*)\2 (12-117) 
Z3=1+ SSS 
16x? \2 4 E 
and OL aAn = (Z1 — 1)(Cave ASO utonc) 
_ 20 4-19 (12-118) 
A ia ies a : oo 
16x“ 2 «& 


In fact Z, reduces to one in the Landau gauge (A * = 0) to all orders, as a 
consequence of the transversity of the vector propagator and of the factorization 
of the incoming ghost momentum. 

In practical cases, matter fields are also coupled to the gauge field. We list 


7 - Figure 12-5 One-loop contributions to the 
ee ees eee Phost-vecton vertex. 
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nee Figure 12-6 Fermion self-energy. 


the counterterms involving spinor fields, as well as the extra contributions to 
6L42, OL, and 6 V4 that they induce. The coupling has been written in Eq. 
(12-93). We use the following notations for the quadratic Casimir operators in the 
representation of the fields: 


tr (°F) = — 176” 


12-119 
TTY = —Cyl ee 

If r stands for the order of the group [number of generators of the Lie 
algebra, for example, r= N? — 1 for SU(N)] and ny is the dimension of the 
fermion representation [n; = N for the fundamental representation of SU(N) ], we 
have the relation 


Dh CG yNy (12-120) 
For the adjoint representation, n = r and hence T = C [=N for SU(N)], while 


T; =4and Cy = (N* — 1)/2N for the fundamental representation of SU(N). 
The counterterms generated by the diagrams of Figs. 12-6 and 12-7 read 


5L jy =(Z2 — IWidy — (2% ie 1) mb 
a g aC; = 2 
= 34 (12-121) 
Mo _ g Cy _ (1 771 2 
2,7 =1- TénzL* (l-—A ie 
and SL jay = (Zire — is 
a1 2s ,_G-* 9] 0, yp 12 122) 
Zir=1 ae ei C+ ACs — 


These expressions reduce to those computed for quantum electrodynamics in 
Chap. 7, if we set Cr = 1, C = 0, g?/4n = a. 

Finally, the modifications induced by an internal fermion loop (Fig. 12-8) may 
be expressed as 


0£(A) ee ae 625 tr (FuvF*’)] 
sf = = O25 U5) = 6Z, (F) __ =e (F) _ Tyg? 42 (12-123) 
16x? 3 ¢ 


The reader will have no difficulty in working out the case where scalar fields 
are coupled to the gauge vectors. 


NONABELIAN GAUGE FIELDS 593 


Figure 12-7 Fermion-vector vertex. 


12-3-4 One-Loop Renormalization 


We have discovered that all counterterms have the same structure as monomials 
of the initial lagrangian. This is not completely sufficient to insure that the re- 
normalized and bare lagrangians enjoy the same symmetry properties. Omitting 
matter fields for a while, we found explicitly 


L + 6L = tr {$Z3(0,,A, — 0)A,)(0"A" — 8°A") + *(a-4)? 


2 
— 921(0yAy — 8yA,)[AY, AY] + £ Zl. Ay, AVILA", AD} 
a Z 30,00" 5 gZ 1(O,%» A“ aNcCate) (12-124) 
We define the bare fields and parameters according to 


Ao=Z¥?7A no =Z3?N og = ZY 


side a (12-125) 
Jo = Z1Z3 j g Ao = AZ3 


Then # + 6# may be regarded as the initial lagrangian (Ao, No, N03 Gos Ao) 
written in terms of bare quantities provided the following identities hold: 


es = 41 = 41 (12-126a) 
Z1 Z3 Z3 

A glance at the expressions (12-114) to (12-118) suffices to convince ourselves that 
they are satisfied to the one-loop order. These identities, which generalize the 
relation Z; = Z, of quantum electrodynamics, express the fact that the coupling 
constant renormalizations of the cubic, quartic, and yA vertices coincide, i.e., 
that the universality of the coupling is preserved by renormalization. In the 


Figure 12-8 Fermion contributions to the two-, three-, and four-point functions. 
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presence of matter fields, fermions, say, we demand moreover that 


Ze_ 21 _ 21 _ Zr (12-1265) 


This relation, too, is satisfied by the expressions (12-121) to (12-123). 
Finally, we write the coupling constant renormalization in the presence of 
fermion fields : 


Go = Z49 


A? (12-127) 
* ic =4T;)) In = ae 


ee 


Notice that the gauge dependence has disappeared. For future use, 2,¢ has been 
replaced by In (A?/y”). 


12-4 RENORMALIZATION 


This section deals with the renormalization of nonabelian gauge theories with an 
unbroken symmetry. What happens as the local symmetry is spontaneously broken 
will be examined later. The issue is to know whether the remarkable features of 
gauge theories, in particular the universality of the coupling constant, will be 
preserved by the renormalization. A regularization is introduced at intermediate 
stages; in practice, the dimensional regularization is the most convenient. The 
desired properties will follow from Ward identities, first derived in this context by 
Slavnov and by Taylor. The reader may find the repeated use of such identities 
in Chaps. 8, 11, and here rather cumbersome. However, nonabelian gauge theories 
are more intricate and require a sophisticated analysis. 


12-4-1 Slavnoy-Taylor Identities 


We start from the generating functional 


eG) = | P(A) det . exp | | as (2 : -— + -A)| 


where .@ is the variation of ¥ with respect to a gauge transformation 
6A = Déa 
a ee (12-128) 
We are going to exploit the property that the measure /(A) det .@ is invariant 
under this transformation, even when 6« itself depends on A. In other words, if 
A=A'+6A and F'(A'))= F[A(A)] = F(A) + OF 
then 


Q(A) det Ms(A) = D(A’) det We(A’) (12-129) 
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The proof is straightforward. We write A(A) for det (A): 


{2a (A) = | 2a #(A)5(A’ — A+ 5A) 
= | emena@anaoa — 9A)A (AOL FPA) — F(A)] 
= | DQAA Ags? AAs? 'A)S[F(A) — FO" AY 


= [2a 


where the invariance of the measure </(.4) and of A.-(A) under a transformation A—°A has been 
used. 


Consequently, 
. A 

ere — | 2 det ./ exp fi ats (v a F?+J-A-AF Mba t+ J- 6x) 
The content of this identity is most easily explored if we take 6a = “~ ‘dw. This 
corresponds to a nonlocal gauge transformation, which translates ¥ by 6m. To 
lowest order in 6q@, we get 

i= i det a | asar — J-DM~*)d@ exp [ | d*x(L—4F? + J-a)] 
or, after substitution of 6/idJ for A, 


bea] [ernnnn(ss)ac(unt sewer im 


where summation over repeated indices is understood. The expression 


x 


i J 


may be regarded as the ghost propagator in the presence of the source J. 
The Slavnov-Taylor identities finally read 


6 5 
LF, F aT = |e = fe [JD (£)) Joreno: x) 


r) 
Me, @| era Gu 3 = é Hons D, a eh) 
b Evel (y, x) baO ( y) 

It is difficult to express in a compact way the Slavnov-Taylor identities (12-1 32) 
on one-particle irreducible functions. A transformation discovered by Becchi, 
Rouet, and Stora enables us to reach this goal. The ghost fields are reintroduced 
in the action 


Gea), x) =Ma* (> x; : a) ee) (12-131) 


(12-132) 


I= [ats (z —_ 5 F — i-n) (12-133) 
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It is easy to show that this action is invariant under a combined transformation 
of the variables 
5AM G(x) = DY an(x)nv(x)dC = sAdl 


Ona(X) = AF aL A(x)]OC = sol (12-134) 
Sna(x) = — : Conene(x)nel x66 = sn dC 


This transformation is local, in contrast to the one encountered above. It introduces 
an x-independent anticommuting parameter df and mixes commuting and anti- 
commuting variables. 


The invariance of I is easy to prove. First ¥ is invariant since 64 Is a special gauge transformation. 
Second, we observe that 


s(-3 7) — (59) Mn =0 


as a consequence of the anticommutation of 7 and 6¢. Finally, 


6(Mn) = 6 oF py | 
5 A 
-0'F OF 
= het jt. is -. é6(D* . 
5a,5A0, n)o(D"n) ado + 5A, (D*n) 


The first term vanishes because (D’y),(D“1), is antisymmetric in the interchange of (ya) and (vb). 
while it is easy to see that 


6(D"y) = 0 (12-135) 
as a consequence of (12-134) and of the Jacobi identity. For later use, we also mention that similarly 
5(Cavetotte) = 0 (12-136) 


The Becchi-Rouet-Stora transformation s is defined as the right derivative of (12-134) with respect to 
6¢, that is, sA= Dn, sna= —gCarcnon-/2. Equations (12-135) and (12-136) imply that s?A =0, 
7a 


This invariance implies identities between Green functions. Let us first show 
how the Slavnov-Taylor identity (12-130) is recovered. We start from 


| 2u. 4, M)Na(x) exp [ (: + | ad yar 4)] 10) 


which results from the odd character in the ghost variables of the integrand. We 
then perform a change of integration variables of the form (12-134); such a change 
does not affect the integral and has a jacobian equal to one, as is easily checked. 
Hence 


= |, n, 1) Ea + NalX) fer Un Dns | exp : (: “7 ja Je 4)] =50 


(12-137) 
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The integration over n, 7 entails the substitution 


Nalx)yo(y) > iM” "yay; x) 
and leads to (12-130). 


12-4-2 Identities for Proper Functions 


The identities are first expressed on connected functions. It is convenient to 
introduce sources not only for the ghost fields but also for the composite operators 
involved in the transformation (12-134). We write 


eHIE8,.K,L) | 2. n,n) exp ji a (z = Se — nn 


= 12-138 
+A + S14 16+ Kod Lon) ( ) 


G(J) = GU, S E, K, DE) ere esto 


Here K",(y) and L,()) are local sources coupled to sAy,a(y) =(D,n)a(y) and 
— SHV) = GCareHol(y)N-(y).2, and are therefore anticommuting or commuting 
objects respectively. More precisely, if 7 is given the ghost number y = —1 (and 
y, +1), K and L have y = 1 and 2. For power counting sA and sy, and hence 
K and L, have dimension two. We assume hereafter that the function ¥ is linear 
in A: 


Fay) = b"abAyr(y) (12-139) 


For instance, the generalized Feynman gauges correspond to $"g, = 0"6a5. The 
use of a nonlinear gauge condition ¥ would require the introduction in (12-138) 
of an extra source coupled to it. 

After a change of variables of the form (12-134), we get 


0= | a4 n, 1) [ats (J-sA + &sn — AF E)(x) exp fifa (-- )| 
since sA and sy are also invariant. This is rewritten as 
6 = r) 2 
4 Pa te ee Aon a gfe) ee GIEEKL) _ 0 
| me E ok °isk (=) oo 


The operator of functional differentiation is linear owing to hypothesis 
(12-139). Hence 


oo wee, 6 a 
eG ee “s -140 
| d*x E a é a NEF ( =) jou. é&& K,L)=0 (12-140) 


In the preceding section, the identity (12-130) was supplemented by an equation 
of motion for the ghost propagator [Eq. (12-132)]. The analog here is obtained 
by expressing that the functional integral (12-138) is invariant under an infinitesimal 
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change 7] > 7 + 64 where 6y is arbitrary. We get the local relation 


| 2u. n, 4) (— Mn + €)(x) exp : | ave =o 


or, since “yn =sF = p:sA: 


G(J,2,2,K,L) _ 
(<- ? = a) e 0 


For connected functions, this reads 


) 


E a 2-141 
5K) G(J, 6, 6, K, L) = (x) (12-141) 


o 


Equations (12-140) and (12-141) must now be translated in terms of proper 
functions. This is achieved after a Legendre transformation 


T(A, n, 7, K, L) = —iG(, €, & K, L) - [atvurd + En +H) (12-142) 


where 


8G 8G es 
“BT gE BE 


or, equivalently, 
~ 6A on , on 


In this operation, the sources K and L are passive spectators: 


6G OF 6G OF 


Pe SK Grae (12-143) 
The identities (12-140) and (12-141), expressed on T, read 
[at aK tart an un foo=0 (12-144) 
OA6K On OL 
iad the sees t Fey = (12-145) 
If the modified effective action I’ is introduced 
r=T+ slaty F*TA(y)] (12-146) 


Equations (12-144) and (12-145) take the simpler form 
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or or of or 

d*x| — — + — — = 
| «(3 5K on =z) ae 

ar or 
Sas ee 

OK* dn 
The identities (12-147) have a universal form. They no longer involve any 
parameter affected by the renormalization, such as the coupling constant, and 
make no reference to the group structure, which has been hidden in the definition 


of the sources. Consequently, they apply as well to the action and are therefore 
suited to study the structure of counterterms. 


(12-147) 


wy) 0 


It is easy to verify that the identities (12-144) and (12-145) are satisfied to lowest order. The first one 
expresses the invariance of the action under the Becchi-Rouet-Stora transformation 


6p 67! 
6A = —5 én = —-—— 6¢ on = AF (A)6 
5K 4 q 5 dC N = AF (AOC 


The condition that the jacobian be equal to one reads 
& yl 671?) 


set 1) 12-148 
546K no ( ) 


and is obviously satisfied. 
It is a good exercise to perform the analogous analysis in the abelian case, in a gauge such as 
F = 0,A" + A?/2 which requires the introduction of ghosts. 


12-4-3 Recursive Construction of the Counterterms 


In the preceding considerations, the theory was implicitly dimensionally regular- 
ized. We now want to renormalize it, in a way which preserves the previous 
identities. As in the one-loop computation of Sec. 12-3, it is convenient to perform 
the minimal renormalization defined in Sec. 8-4-4. In other words, we content 
ourselves with eliminating the divergent terms as the dimension d goes to 4. Order 
by order in fi, we will write 


Fig, = Ty + Ton (12-149) 


where I”), is computed by taking into account all lowest-order counterterms. 
More physical conditions may be substituted, provided they are consistent with 
the identities (12-144) and (12-145). 

Equation (12-147) tells us that all functionals depend on K and 7 only through 
the combination K — 7@. We will use the compact notation 


cleor, ol, a | 
=— | fee 2 SO 2 12-150 
rietas [atx (S BK nai ( 


The identity (12-147) satisfied by the power series in } 
ie = To ak Tt} a T?! J ga 
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reads to nth order 
Pls Ta — 0 (12-151) 


ptq=n 
and we seek counterterms such that the renormalized IY! satisfy 


ert, 0 (12-152) 
pt+q=n 


0 
We proceed in a recursive way. To lowest order I‘! = T°) reduces to 


T° = 1(A,n, 7, K, L) = {as [-L(A) —47My + K-+sA — Lsn} 


2 | d*x[L +(K -19)*sA— Lon] (12-153) 


which of course satisfies (12-152). To first order, we have 


i iG. Si ke I 0 (12-154a) 
T+eT 4 TM s7T=0 - (12-154b) 


Equation (12-154b) is simply the desired identity (12-152) for n= 1, while Eq. 
(12-154a) gives the structure of the first-order counterterm. It is suggested to 
modify J into _ 


Toye ier (12-155) 


so as to cancel the divergent part. However, the recursive procedure works only 
if the renormalized action J, satisfies itself: 


1,*1,=0 (12-156) 
as I does. This is not the case for (12-155), as 
Tat = Vis oe 
However, the right-hand side is of order 7. We are thus led to define 
T,=1-TH +A, 


where the extra piece A, is the integral of a local polynomial in the fields of degree 
four, of order ft”, and defined in such a way that (12-156) is satisfied. Of course, 
it does not affect quantities to order one, and hence saa ate the finiteness and 
normalization condition of rt. 


This complication is quite typical of symmetries involving nonlinear identities. The same phenomenon 
occurs, for instance, in the two-dimensional nonlinear o model evoked at the end of Chap. 11. 


The structure of A, will be deduced from the one of Ii! If we are able to 
show that 


(i — TENA, 0, 1, K, L3 9) = 1(Ao, No. fo, Ko, Logo) + Of) (12-157) 
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where 

Ag = Z47A No = Z4?n fo = 2474 

Ko= 22K Ip =ZEPL go = Zag (12-158) 

‘ with Z3 = 1 + 23h, etc., a natural choice will be 

1,(A, n, 9, K, L;.g) = 1(Ao, no, fio, Ko, Lo; go) 
This new action J, will satisfy 
oe | we 2 — tr i — ), gyagy a “) a ie | 
If, moreover, 

Zp 7 3 (12-159) 


then the condition (12-156) will be fulfilled and the recursive proof can be pursued. 


The aim of the forthcoming technical discussion is therefore to prove (12-157) and 
(12-159). 


We look for the general solution of the equation 
off} = T+ Tl + eT =0 (12-160) 


satisfied by the divergent part I") to a given order n, when all counterterms of smaller order have 
been taken into account. 

The operation o in({ 12-160), which is a generalization of the Becchi-Rouet-Stora transformation 
s, is nilpotent 


(12-161) 


Q 
lt 
(=) 


This is easily seen if o is written as 
aI a ee aI a 
o=—— +e 
Ox; 06; 00; Ox; 


where the commuting and anticommuting variables have been collectively denoted x'and 6 respec- 
tively; in our problem ‘x,’ = {A, L?, '6;! = {n, K}. Then (12-161) results from the identity 
a oe _ (12-162) 
Ox; 00; 
satisfied by 7. Explicitly, 
: (22 a0 aeroa aa aa aa a a 


_————— + — — — — 
Ox; Ox, 08; 06; . 00; 00; Ox; Ox; Ox; 06; 00; Ox; 00; Ox; Ox; 06; 


é (al ) nn) é a) f) 
+ 3x, \ax, 86,) 00, 0, \ Bx; 0,/ dx, 


The first bracket vanishes as a consequence of anticommutativity and the two remaining terms owing 
to Eq. (12-162). On the other hand, any gauge-invariant functional Rin (A) of A only satisfies 


(D*n) a(x) = 9 


ORiny 61 = iG ORiny 


. — 4 = 
oRinv( A) | Cea EK * 5A¥(x) 
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Therefore an expression of the form 
R= RigfA) + OR’ (12-163) 


is a solution of oR = 0. It is possible to show that this is the general solution of the equation for 
s/(A), even when additional sources coupled to gauge-invariant composite operators are added to 
the initial lagrangian. In the present case, however, we can prove it by inspection, using the fact that 


power counting and ghost number conservation restrict the form of I$? to 


Piel [ats {I(A) + [K"a — (76")a]Auas» + $dereLanvnc} 


In this expression, /(A) is of dimension four, A,) is of dimension one, and thus at most linear in A, and 
dare = —dacp are numbers. If we assume that the global symmetry is unbroken (by the choice of 
gauge), then 


Ayab = COpOan + BGCabe Ape 
dave = YGCabe 
with a, 8, and y being pure numbers. Inserting these expressions into (12-160) yields 
p=y 


6l{A) Ge 
iz = CIC ei = 
> BAM, + g(B — a)Car, 3AM, 


A particular solution to the second equation is 


OL 
* 0A", 


~ (A) = (B — a) A* 


and the general solution is obtained through the addition of a gauge-invariant functional of A, of 
degree four, and thus a multiple of Y(A): 


OL 


(A) = aP(A) + (B — a) At 


To summarize, F'! has the form 


Ty = [as jae (A) + (B — a) A", = + a(K — 76)*(Dun)a 


+ (B — a)g(K — 1b)aCarcAnct» + B . LCas | (12-164) 


where a, a, and f are of order fi". Simple algebra and the use of the homogeneity property of Y, 


a 
Pe Gj eee 
0A*,(x) 0g 


enable us to rewrite [% as 


Pi — =o Z # é 6 
ry = (| d*x \(o (Rr *) E a(x) 5A"4(x) ate L,(x) a 


+ S| Kooga eae + Geog? al 
ia ae i | Sas nn K, Lio (12-165) 


which is the desired result. All counterterms arise from a renormalization of the parameters of the 
initial action. Moreover, A and L are renormalized in the same way. If, according to the recursion 
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hypothesis, we write the action renormalized up to order n ~ 1 as 
T,-1 = G42. A, VANE WANE Zuae,K,ZH2_ 41; Zon-19) 
then we have just proved that 


Ty = Tyna — FE + One?) 
7 


= I(ZY2A, Z92n, Z¥29, ZV2K, ZV2L; Z, ng) (12-166) 
with ZY? = 242, — (6 -«+$) 
= 04 
Vig i= Za? aa ) 
a 
Eom = Zaman ae 


This completes the inductive proof. 


We have shown that it is possible to renormalize nonabelian gauge theories, 
while preserving gauge invariance expressed through the Slavnov-Taylor identities 
(12-144) and (12-145). Happily the whole operation boils down to wave-function 
and coupling constant renormalizations. We generate finite Green functions if 
we use the action 


IR(A, ", n, K, cE; 9; A) = I(Ao, No, Nos Ko, Lo > Gos Ao) (12-167) 


with the same notations as in (12-158) and Z, = Z3, A) = Z3 ‘A, which takes into 
account the nonrenormalization of the gauge term (4/2). F(A)’ (for a linear function 
F). AS a consequence of (12-167), the Green functions are multiplicatively 
renormalized: 

PR(A,n, 9, K,L; g, 4) = V.g(Ao» No, Mo, Ko, Lo ; go; Ao) (12-168) 
and satisfy the identities (12-144) and (12-145). After completion of this proof, 
we are of course entitled to drop the auxiliary sources K and L in the two 
preceding equations (12-167) and (12-168). 

The previous considerations extend to cases involving matter fields, coupled 
in a minimal way. If fermion fields are present, we assume for the time being 
that the couplings involve no A; matrix (this point will be investigated in Sec. 
12-4-5). Dimensional regularization makes the theory finite, the Becchi-Rouet- 
Stora transformation and the identities (12-147) can be generalized, and, as 
expected from our one-loop computation, equations analogous to (12-167) and 
(12-168) hold. The crucial feature is, of course, the universality of coupling constant 
renormalization. 


The compact formulation of Ward identities may have obscured simple facts. We emphasize that 
the results found in the one-loop computation are mere consequences of the Eqs. (12-147). For 
instance, if the counterterms Z,,Z;,Z4 are reinstated as in (12-124), it is easy to see that (12-167) 
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means that - 
Zi Lae 4 


Zs) 250 2a 
to all orders. Equation (12-147) may also be used to show that the corrections to the inverse 
propagator I, are transverse to all orders. 

A different regularization, investigated by Lee and Zinn-Justin, relies on the introduction of 
higher covariant derivatives in the initial lagrangian. This improves the large-momentum behavior 
of the gauge field and makes all diagrams with more than one loop finite. The one-loop diagrams 
must be regularized independently in a gauge-invariant way. 

The reader may carry out the renormalization program in the axial gauge; it is convenient to 
write the condition n- A = 0 by using a Lagrange multipler in the functional integral. Although the 
ghost fields are not really coupled to the gauge field, their introduction enables us to use a slightly 
modified Becchi-Rouet-Stora transformation and to derive a set of identities. 


12-4-4 Gauge Dependence of Green Functions 


Since at the very end of our computations we are supposed to check the gauge 
independence of the physical quantities, it is important to contro] the gauge 
dependence of Green functions. We observed that the transformation 6A = 
DM~*dq shifts F¥ into F + dw and modifies the action I, of Eq. (12-133) 
according to 
6(6@. 
I¢ + dl =I¢—- [as |A%.00. de je ons | 

b 

= 1g + 50 (12-169) 


Consequently, an infinitesimal change of the function ¥ may be cancelled by a 
gauge transformation on the field A. The latter affects only the source term, and 
according to the equivalence theorem of Sec. 9-2-1 this should not modify physical 
quantities such as the S-matrix elements: 


exp [G¢+as(J)] 


= | 214) ae Mgsssexp4i| dx] 2 = <(F om ary+J-a| 


a 


= [2 det Ws exp fi| ats E ~ 4 F*? +J°(A—-DM all 


(12-170) 


This discussion is quite formal at this point, since in the gauge theories studied 
so far the S matrix is not defined, due to severe infrared divergences. 


In terms of the Becchi-Rouet-Stora transformation, the preceding property is reflected by the 
following structure of 6/: 


OAF, 
6A", 


61 = — [as Eze + Na (D* | 


Ears [nar d*x 
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This enables one to study the gauge dependence of proper functions and counterterms, for instance 
their dependence on the parameter 4. As a typical result, one may show that the coupling constant 
renormalization Z, is A independent, at least in the minimal renormalization. With other prescriptions, 
this may be wrong. The last remark is a hint that the physical interpretation and observation of a 
nonabelian gauge coupling constant may be difficult. 


12-4-5 Anomalies 


We now reconsider a case of physical interest set aside at the end of Sec. 12-4-3. 
Let us assume that fermions are coupled to the gauge field through an axial current. 
We have seen in the previous chapter that anomalies may occur in the conservation 
(Or quasiconservation) of such a current, as a consequence of the impossibility of 
regularizing the theory while preserving chiral symmetry. In the instances studied 
in Chap. 11, namely, quantum electrodynamics or the o model, this anomaly was 
acceptable and of physical interest to analyze the process n° — 2y. If the gauge field 
(abelian or not) is coupled to an anomalous current, the situation is drastically 
different. Slavnov-Taylor identities may become invalid and renormalizability is 
jeopardized. In theories where the gauge field remains massless, such as those 
considered so far, this would mean that all possible counterterms of dimension 
four would be required, spoiling the universality of the coupling renormalization. 
The issue is much more crucial when the symmetry is spontaneously broken. As 
we shall see in the next section, the gauge field becomes massive and re- 
normalizability only results from the underlying gauge invariance. Anomalies are 
then harmful, and it is possible to devise models where they make the theory 
nonrenormalizable. It is therefore important to find a criterion to rule out their 
appearance. 

Asin the previous chapter, the analysis may be restricted to one-loop diagrams. 
There exists a gauge-invariant regularization which preserves chiral invariance at 
higher orders (Sec. 12-4-3). Consider a gauge theory based on a compact group. 
In other words, we admit abelian factors. The lagrangian, including fermion fields, 
is 


L = —$F yaF”. + Wid — m— ig ATW (12-171) 


where I? stands for some combination of T* and T“ys. It is easy to check that the 
anomaly of the axial current 


W)n)s Ty 
as given by Eq. (11-225), is proportional to the combination 
dape = tr (T*{T”, T*}) (12-172) 


The vanishing of this quantity may be realized by each species of fermion (each 
representation) coupled to the gauge field; this is in particular what happens for 
real representations where the matrices T are antisymmetric: 


eee (hon 7} 7) — —d,,=0 (12-173) 
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But the condition may also result from a cancellation between different species. 
This will be illustrated in Sec. 12-6-4. 

The harmful anomalies are those occurring in axial currents coupled to gauge 
fields. For instance, ifthe combinations /y,75°w are singlets for the internal group 
G, the anomalies of such currents are of no importance here. The matrices /* might 
refer to a different set of quantum numbers, e.g., flavor as opposed to color. Tike 
corresponding anomalies are proportional to 


dare = tr (A%{T*, T}) = tr A¢ tr {T*, T} 


which vanishes in the case of an SU(3) symmetry, for instance, since tr 4* = 0. 
Only the U(1) current 


Jus = WyuysW (12-174) 
is anomalous: 
O"Jys = 2impysW + CgEpvpcF oF a 
= 2imbysh + 2Cg?O"EpvpeA"alO? Ag — 0°APa — $GCave A, A”) (12-1744) 
with C = — T;/ 162”. On the other hand, there exists a conserved current 
Jus = Jus — 2CgEpvpeAa(O°A%a — 0°A%ag — 3GCaveA’y,A%) (12-175) 


but it suffers from the lack of gauge invariance. 


12-5 MASSIVE GAUGE FIELDS 


12-5-1 Historical Background 


The nonabelian gauge theories considered so far enjoyed an exact local symmetry, 
and consequently the gauge field was massless. Such theories are used nowadays 
to construct models of strong interactions. However, historically, after the intro- 
duction by Yang and Mills of nonabelian gauge fields, physicists have struggled 
for years in order to build a meaningful theory of massive gauge fields, hence 
breaking explicitly the local symmetry. 

A strong motivation came from the study of weak interactions. We recall from 
Chap. 11 that the current-current Fermi theory provides a remarkable phenomeno- 
logical framework. The weak interaction lagrangian (or up to a sign, the 
hamiltonian) was written (compare with 11-62) 


G 
Fin = — Far sie (12-176) 
This is, of course, a zero-range interaction. 
In spite of its successes for low-energy processes, this model suffers from 


serious problems. As the dimensionality of the coupling constant G shows, the 
theory is nonrenormalizable. Alternatively, power counting assigns dimension six 
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to the product J"J,". At high enough energy we cannot content ourselves with 
the Born approximation. In order that a scattering amplitude satisfies the unitarity 
condition, at least perturbatively, higher-order terms must be added. These 
corrections, however, are plagued with ultraviolet divergences, the elimination of 
‘ which introduces a growing number of arbitrary parameters. In practice, the 
nonrenormalizability makes this computation impossible. 

Another aspect of the same problem arises when we consider the Born 
approximation for some cross section o. On dimensional grounds, we expect at 
high energy the behavior 


o ~ constant x G’s (12-177) 


where s is the total center of mass energy square, while in every partial wave the 
unitarity limit reads 
constant 
o~ ———— (12-178) 
S 
Therefore, we expect a violation of unitarity to arise at energies of the order 
/s ~ G1? ~ 300 GeV. 


It is a good exercise to compute explicitly the constant appearing in (12-177) and (12-178) for such 
leptonic processes as v¥ > vv, ).e7 > ¥,e", and v,e7 > Voy”. 


Both aspects, nonrenormalizability of the theory and bad high-energy behavior 
of the Born approximation, are manifestations of the same phenomenon. This is 
obvious if we use dispersion relations to compute a one-loop contribution to some 
elastic scattering amplitude in terms of its discontinuity, ie., in terms of some 
Born cross section. The behavior of the latter results in severe divergences in the 
dispersion integral. 

It is therefore mandatory to transform the Fermi theory into a respectable, 
i.e., renormalizable, field theory. A tempting hypothesis consists in introducing a 
charged vector field W, and coupling it to the current J,: 


Line = GI x)W,"(x) + hie (12-179) 


The analogy with electromagnetism is evident. The intermediate boson represented 
by the field W would be the quantum of weak interactions. To explain the 
validity of the Fermi theory at low energy, we assume W to be very massive. The 
implications of (12-179) would depart from those of (12-176) only at high energies. 
Let us consider, for instance, the decay. The Fermi theory gives the amplitude 
(Fig. 12-9a) 


= © pedro — ys)v(pr,)a(pr,)y°(L — ys )ulPw) 


f 
whereas the W boson contributes 
gh? — k’k"/Miy 


| 22 U(py,)yo(1 — ys)u(Pu) 


ig’ti(pe)y A1 — ys)v(ps,) 
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(a) (b) 


Figure 12-9 Muon decay (a) in the Fermi theory and (b) as mediated by an intermediate vector boson. 


The two amplitudes coincide at energies such that k* = (p, + p<)’ « Mwy, provided 
ieee (12-180) 


Similarly the Born approximation to a scattering amplitude such as wi wv 
would be reduced at high energy by a factor M/s with respect to the Fermi 
amplitude. The violations of unitarity have seemingly been eliminated. Actually, 
the amplitude vi - W*W_ still has a bad behavior. This points to the necessity 
of introducing more fields and more couplings. 

The dynamics of these massive vector fields have now to be prescribed. 
Particularly troublesome is the question of renormalizability in view of the large 
momentum behavior of the propagator: 


( wv Ky =) — Mi)"* ~ constant 
Mw k? >My 
However, we remember that in quantum electrodynamics the introduction of a 
photon mass has not spoiled renormalizability. If we adhere to the prejudice that 
more symmetry in a theory reduces the number of divergences, it is suggested to 
consider the W, as a member of a set of gauge fields. A candidate for a symmetry 
group is SU(2) and the local invariance is of course explicitly broken by the 
mass terms of the W. This would lead to a universal coupling of the W. Since 
SU(2) gauge fields must form a triplet, a third neutral vector field has to be 
introduced. Such a model, suitably amended, will be shown to be renormalizable. 


For completeness, we mention a different historical reason for introducing massive vector fields. It 
was suggested in the early 1960s to base the theory of strong interactions on a gauge principle. The 
invariance group was SU(3) x U(1), corresponding to the eightfold symmetry and baryonic charge 
conservation. The vector bosons—massive gauge fields were identified with the existing p, K*, «, 
and ¢ particles. 

An interesting property of such a model is that the forces were attractive between particles of 
antiparallel isospins and repulsive for parallel isospins, a generalization of the electromagnetic 
attraction between opposite charges. Such a property is in agreement with experimental facts at low 
energy. 

To see this, let us compute the elastic scattering amplitude of two scalar particles belonging to 
the real representations (1) and (2) of a simple Lie group. The lowest-order contribution (the exchange 


NONABELIAN GAUGE FIELDS 609 


t 


OU oe) 


(2) "5 6 


Figure 12-10 A scattering amplitude to lowest order. 


of the vector field as shown in Fig. 12-10) comes from the following terms of the lagrangian: 


F = UDP (DMG) + HOPG) (DG) (12-181) 
where DM — g# — gA*, Ti T” antisymmetric a = 1,...,7 
and reads 
s-u 
7 =7 —— », tats (12-182) 
Vig a=1 
The quantity 


Xena = Se TEY TR 
a=1 


must be projected on irreducible representations. To this end, we introduce the Clebsch-Gordan 
matrices for the product of the representations (1) and (2). If n,; and nz are the dimensions of the 
representations, the n,; x m2 matrices M4 satisfy the orthogonality and completion relations 


tr (M4M*) = 6542 A, B= 1,..., mmo 


12-183) 
MoAMOs = baa’ f ( 
2 B B BB 
and transform according to the (t) representation 
; TOMO _— MOAT) — THeyoP (12-184) 
It is then easy to see that 
Xens= TST" 
=— Y Mepoeyos ror 
a,A.t 
1 
=-» [cli)+ cQ)—cwo|MomMy (12-185) 
5) B y 
ely i 


in terms of the Casimir operators for the representations (1) and (2) and (t) [see (12-119)}. The value 
of the Casimir operator increases with the dimension of the representation. For instance, in the case of 
SU(2), the representation of isospin I has C(/) = I{I + 1). The desired property follows from the 
positivity of (s — u)/(M ? _ t) in the physical region. 

This calculation of a crossing matrix, relative here to internal degrees of freedom, is analogous 
to the Fierz reshuffling of Chap. 3. 

This model, considered by Sakurai, also predicts the widths of vector bosons. Neglect the 
¢@ — w mixing (which improves the computation) and couple the (a, K,) and (p, K*, @) octets in a 
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gauge-invariant way. The p-wave amplitude of the elastic mz, K, and KK channels in the Born 


approximation reads 


ge ag? 


ras 


al = 
Pie 


where q is the center of mass three-momentum, and « = $, |, and 2 for the three channels respectively. 
A unitary amplitude t,; may be constructed in the form 


7, 


= ——____ Im t; = p|t;|? 
(aes: 1 p|ti| 


ty 


where p = 2(q|/\/s (x 4 for mm due to the identity of particles) is the phase space factor. This 
amplitude exhibits a pole at s ~ M? — ipaq’g?/16x corresponding to a resonance of width I ~ 
paq’g?/16xM | awe Adjusting the value g?/16x = 0.63 yields T, ~ 130 MeV, Ine = 38 MeV, and 
Ty ~ 4.5 MeV, which reproduces fairly well the experimental! values (125, 50, 3.2 respectively). 


12-5-2 Massive Gauge Theory 


Is a gauge theory where mass terms are introduced by hand renormalizable? 

In electrodynamics, the situation is favorable. After separation of the gauge 
field into transverse and longitudinal components, the longitudinal part k,k,/M? 
which gives rise to the bad behavior in the propagator does not contribute to 
the § matrix. This results from the noninteraction of longitudinal and transverse 
components and from the coupling of the field to a conserved current. In a non- 
abelian theory, none of these properties is satisfied. Longitudinal and transverse 
parts do interact, while the current to which the gauge field is coupled is not 
conserved. On the other hand, unexpected cancellations of divergences at the 
one-loop level make the theory look like renormalizable. This explains why it 
took some time to reach a consensus, namely, that the theory is not renormalizable. 
The way out of this unpleasant situation is to appeal to the mechanism of 
spontaneous symmetry breaking, to be explained in the next subsection. 


The aim is the construction of a renormalizable theory with the requirement that the physical states 
be the massive vector fields only. If we use auxiliary fields, as in the Stueckelberg method for the 
electromagnetic field, we must check that only the three physical degrees of freedom of each massive 
vector boson contribute to unitarity. In the forthcoming method, where local changes of variables will 
be performed in the functional integral so as to improve the behavior of the propagator, this 
requirement will be fulfilled by virtue of the equivalence theorem. We thus consider the generating 
functional 


et) a [2 exp i [as [Y-2tr aay} (12-186) 
where we use again matrix notations 


L = tr ($F,)F” — M?A,A*) (12-187) 


The canonical quantization may be in trouble, since 19, = 5 Y/509A°, = 0. However, the presence 
of the mass term insures the existence of a propagator, and hence the definiteness of the previous 
functional integral in a perturbative sense. The Faddeev-Popov operation is not necessary, but we do 
it nevertheless, in order to improve the behavior of the propagator. We choose a gauge condition 


A= 
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as in Sec. 12-2-2 and insert in (12-186) the identity 


l= | 2011 5[ F (2A) — C] det Ms(A) (12-188) 


We obtain 


aati [vung T] of FA) — C] det Ws exp | d*x[Y —2tr(J- ai} 


In contrast with the massless case, ¥ is no longer invariant under the gauge transformation 
A — %A._ If we parametrize g(x) as 


glx)" 


it is easy to show that 
2 1 
tr (4"A,) = tr | 9A" 9A, —— 0,2 94" + — d,EPH(A, 2) 
g g 


where P¥” stands for the formal series 


2 D>) 


feds biaye— SW [oa Beata [ir Bree 12-189 
(A= 3 oy 10th fl (12-189) 
the generic term of which has n brackets. The § matrix is not affected if we replace the source term 
J: A by J°9A. After a change of variable A*~'A and a gaussian integration over C, the new 
generating functional reads 


eo) = [ a0, En, fi) exp { | d*x[2A, £1, i) — 2tr(J- ay} (12-190) 


e 


It involves a lagrangian with the field A,, the Faddeev-Popov ghosts 1,7, and a new field €, with 
conventional commutation assignments 


M? M? 
LA, €. = or Ea = WI AWAY + 9 rA(A) + 2 eed - a 6,EP*(A, 2 | — fee 


(12-191) 


If M? = 0, the field = disappears from the lagrangian and the € integration gives an (infinite) factor 
which does not contribute to e@)~ ©. 

It is convenient to choose the Landau gauge, ¥ = ¢, A", 2 > &, to compute the vector propagator 
— i(Yuy — Kuky/k?)(k? — m?)~*. The € and » propagators behave as 1/k’. The superficial degree of 
divergence of an L-loop diagram is 


SIE ia; 2S 22 


where n; is the number of vertices of type (i) and d; is the number of field derivatives on such a vertex. 
For a proper diagram with E external lines (among which no € line), this yields 


wo<2L4+2-—E+ >, nd; —2)<2L+2-—E (12-192) 
internal 
verlices 


to be compared with 
o<6L+E-—2 


in the original theory (12-187). 
Observe that for an abelian theory, P, = 6,€, the fields & and y are not coupled to the vector 
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‘ f Figure 12-11 A diagram with a quartic divergence, in a massive gauge 
ger theory. The dotted lines represent auxiliary field € propagators. 


field. We have just recovered the result of Chap. 8, namely, that massive electrodynamics iS Tre- 
normalizable by power counting. 

In a nonabelian case, if we restrict ourselves to the one-loop approximation, Eq. (12-192) gives 
the same superficial degree of divergence as in a renormalizable theory, w = 4 — E. To this order an 
effective lagrangian for diagrams without external é or n lines reads in the Landau gauge 


M2 
Y',=tr (s.r — M7A,A* — gD") — 70,D*"ny (12-193) 


The gaussian integrals over & and 7,7 may be performed. The former yields det 12 As while the 
latter gives det M . Hence, to that order, a single auxiliary field suffices, with the prescription that 
a factor —4 is attached to each closed ghost loop. The presence of this factor —4 to be compared 
with a factor —1 in the massless case shows that the limit M + 0 must be singular. We have seen in 
Sec. 12-3 that the ghost contribution (with a factor — 1) was crucial in the massless case to maintain 
gauge invariance. We thus expect the modification of the prescription to modify the counterterms. 
For instance, it leads to mass or gauge term renormalizations, and, more seriously, to other four-point 
couplings. Therefore, even though the theory looks renormalizable to this order, the symmetry of the 
counterterms is lost and serious difficulties occur to higher orders. For instance, according to Eq. 
(12-192) the diagram depicted on Fig. 12-11 has a quartic divergence, w = 4. 

We thus conclude that in spite of cancellations of divergences, the massive gauge theory is not 
renormalizable. 


12-5-3 Spontaneous Symmetry Breaking 


We have studied spontaneous symmetry breaking in Chap. 11, where boundary 
conditions allow us to choose among a set of degenerate ground states. A 
remarkable feature of this phenomenon, in the case of a continuous symmetry, 
is the appearance of massless particles. These Goldstone bosons are the zero-energy 
excitations connecting the possible vacua to each other. It is natural to reexamine 
this phenomenon in a gauge theory (abelian or nonabelian) where long-range 
forces are present or, alternatively, where there may exist an unphysical sector in 
the Hilbert space. It turns out that in the presence of a broken gauge symmetry, 
the long-range forces are screened. The Goldstone bosons and the gauge fields 
conspire to create massive excitations, and the massless excitations are un- 
observable. 

This phenomenon was discovered and studied in the context of super- 
conductivity. Electron pairs responsible for superconductivity may be described 
by a wave function y = pe". The charge density, proportional to w*w = p?, 
must be constant throughout the crystal to neutralize the background charge of 


the ions. In the presence of a magnetic field with vector potential A, the current 
J reads 


i pies 5 Os 
meme \q | OTA |e 
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where q = 2e is the charge of the pair. The divergence of J vanishes, and hence 
in the transverse gauge V-A=0 we have A@?=0. For a simple geometrical 
configuration, 6 is constant; thus the Maxwell equation 


Ve 


leads to 


A (12-194) 


The vector potential A is screened on a characteristic length J, A = J m/ |qp|. 
This is the Meissner effect which prevents a magnetic field from penetrating a 
superconductor. 

Returning to field theory consider a charged field @ coupled to an abelian 
gauge field. The lagrangian is 


# = —aOyAy — CvAy)(C*A® — BAY) + (0, — ie A,)b*(6" + ie A") — V(9) 
(12-195) 


The potential 1(¢) is invariant under local transformations ¢— e'¢ and its 
minimum occurs for a nonzero value of ¢*@ (compare with Fig. 11-5). For 
instance, 


V(b) = uw o*h + A(P* Ob) (12-196) 
with p? < 0, A> 0. In the ground state <g> = v//2 with 
2 + Alo? =0 (12-197) 


A global rotation may always transform <#> to a real value, since it is x 
independent. We assume, therefore, 


grea tae 
, Wp 


and translate ¢: 


iy=v <d2>=0 (12-198) 


di(x)= 9'1(x) +0 <> =0 (12-199) 
In terms of ¢’ (the prime will be dropped), the lagrangian reads 


GP = —4(0,A, — 0,A,(O*A” — 8A") + 40h 10" b1 + 8x b20"h2) 


2 4? 
OK (62 + 63 + 20d: + 0”) 


2 
— "(t+ $3 +2061) +5 


a “(63 + $3 + 2d: + v7)? — €A,h2 6,61 + evA*Gyh2 (12-200) 
By virtue of Eq. (12-197), the coefficient of $3 vanishes ; $2 is a Goldstone boson. 


But wait! a mass term for A, has appeared: $e7v? A’, together with a mixed 
term evA,,0“2. The quadratic form in A and ¢2 is diagonalized by the combination 
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Bee 4, + ee (12-201) 


which looks like a gauge transformation. Then, the quadratic part of 7 1s 
L, = —4(0,B, — 0,B,)(0"BY — OB") + 50, "Gs + Ze7e7 B? — (270°) $i 
(12-202) 


The net result is the following. The gauge field has acquired a mass while 2 
has disappeared, at least from Y,. In fact 2 can be eliminated altogether if we 
use a different parametrization of @: 


p(x) = ell” yaa) (12-203) 


j2 


with hermitian fields v and p. After a local gauge transformation 


! — hue eg 
o-oo =i? 8 ee 
J (12-204) 
i lag 
Ay > Aly = Ay + 7 Cul 


the lagrangian takes the form 


G = -1(0,A', — 3,A',)(04A” = 8A”) 


+ (0, — ieA’,)b'(O" + ieA’)d’ — po? — Ad’* (12-205) 


Expanding ¢’ = (9 + ayy we verify that 4 has indeed acquired a mass M = |e|v 
and that the Goldstone boson has disappeared. We started from a system describ- 
ing a charged scalar field (two states) and a massless gauge field with two 
polarization states. After spontaneous symmetry breakdown, we have one real 
scalar field and one massive vector field with three polarizations. The number of 
degrees of freedom has been conserved and the Goldstone boson has been trans- 
muted into the longitudinal polarization state of the vector field. This is the 
phenomenon discovered by Englert and Brout and by Higgs. The remaining 
massive scalar boson is referred to as the Higgs boson. 

The previous mechanism may be extended to a nonabelian symmetry. We 
follow the analysis of Kibble and Lee and Zinn-Justin. Let G be the gauge group 
of order r, not necessarily semisimple, and let the gauge field be coupled to a 
multiplet of scalar fields ¢, transforming according to some irreducible n- 
dimensional representation. The lagrangian is 


L = — dF yygF a + [On — GaT“Aua)O| "(On — gaT Ayal] — Vid) (12-206) 


The antihermitian T matrices are the infinitesimal generators of the representation 


and the coupling constants may depend on the simple component of the group. 
Finally, V is assumed G invariant and its minimum is reached for <@> = v. Let 
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H be the subgroup of G (of order s) which leaves v invariant. The nature of H 
depends on G, on the representation chosen for ¢, and on the form of V. 


Examples 
‘ (a) G = O(n), ¢ is the vector (n-dimensional) representation, H = O(n — 1). 
(b) G = SU(3) x SU(3), ¢ = M is a3 x 3 matrix belonging to the (3, 3) representation, and 


V(M) = « tr (MM")? + B[tr (MM")]? + y(det M + det M‘) + 6 tr (MM") 
The reader will find that for hermitian <M), H contains at least SU(2) x U(1). 


Let the infinitesimal generators of H correspond to T° (a= 1,...,s). The 
remaining generators T° (a=s+1,...,r) generate the coset space G/H. We 
parametrize (x) as 


Q(x) = exp i aed [p(x) + vy/./2 (12-207) 


a=srt+1 


where p and & have a vanishing vacuum expectation valuet. The p field has 
n — (r — s) effective components. In the absence of gauge fields, the € would be 
the Goldstone bosons. However, gauge invariance enables us to eliminate them. 
We perform the transformation 


p(x) > o'(x) = exp -5 ie: uo) (x) = [p(x) + v]//2 


a=st+1 
PEC) | (12-208) 
v 


 T%Ca(x) 
d oe | 


r 
Siok 


yi Gia" Agalx) Te > gal A jas) = exp | - 
a=1 a=1 


x - a.+ > soT*Aul) exp 
a=1 


Ss 


which changes (12-206) into 
~ GA.) = —3F was + [DA 6')'[D( 496] — Vib) (12-209) 


where DD, = oO ma Y GAyal” 
a=1 


Any reference to the would-be Goldstone bosons € has disappeared, and the 
hermitian nonnegative mass matrix of the vector fields is 


(M?)qp A" aA» = Jago Tv)'(T°v)A*a Aus (12-210) 


Since the s first generators T* (a = 1,...,s) map v onto zero, the matrix is block 
diagonal. Only the lower (r -- s) x (r —s) matrix is positive definite and cor- 
responds to massive vector bosons. The s remaining massless fields correspond 
to the unbroken H gauge symmetry. The total number of degrees of freedom is 
conserved, as in the abelian case. 


t The factor fe appearing on the right-hand side of Eq. (12-207) will be omitted when discussing 
real fields, as was the case in Chap. 11. 
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We may now reexamine the semiclassical picture sketched in Sec. 12-1-3 and look for solutions of 
the equations of motion for a theory involving gauge ficlds and scalar fields. Nontrivial static 
solutions of finite energy do not exist for gauge fields or scalar fields alone. However, theories 
involving both scalar and gauge fields may possess interesting classical solutions in three space 
dimensions. 

For a static solution of finite energy, the fields must tend at spatial infinity toward one of the 
lowest energy configurations. Otherwise the energy density would differ from zero by a finite amount 
in an infinite domain. A possible way to insure stability of a nontrivial solution is the existence of 
a set of degenerate vacua. We may then assume that the fields tend to different vacuum configurations 
in different spatial directions at infinity. The solution, if it exists, will be topologically stable if it 
maps in a nontrivial way the S$, sphere at spatial infinity onto the manifold of possible vacua, 1e.. 
the coset space G/H. A sufficient condition is that the symmetry 1s spontaneously broken and that 
the homotopy group z2(G/H) is nontrivial. 

For definiteness, let us consider the Georgi-Glashow model, which is a gauge theory of symmetry 
group G = SO(3), where a triplet of scalar fields is coupled to the triplet of gauge fields. The symmetry 
is spontaneously broken into U(1) = SO(2). If the remaining massless gauge field is regarded as the 
electromagnetic field, we have a model of quantum electrodynamics based on the group SO(3). With 
this in mind, we call the coupling constant e in the sequel. Mathematicians teach us that 
%2[SO(3)/U(1)] = Z, the group of integers, which means that the solutions are characterized by an 
integer topological charge n. 

*t Hooft and Polyakov have studied the n = | solution. This corresponds to a boundary con- 
dition such that ¢, points along the normal to the S, sphere at infinity [identity map from S, onto 
SO(3)/U(1) ~ Sz]: 


s be), > oe (1225 


x00] 


where v is the vacuum expectation value of the real field @. If we choose the gauge 4? = 0, imposing 
that D@ vanishes asymptotically, leads to 


Xb 


| 3 
Ag = Flap — (12-212) 
e x 


It is then possible to show the existence of regular solutions @,(x), Aa(x) satisfying these boundary 
conditions. 

The topological invariant n may be interpreted as a magnetic charge. For this time-independent 
solution the electric field vanishes. At infinity the magnetic field is radial: 


& 

A x 

P= ies 
3 

er 


since it is obtained from the vacuum configuration ¢@> = v through a gauge transformation. 
Its flux through the surface is 


1; 1 An 
[pas=1[ 4 ae (12-213) 


by definition of the magnetic charge g contained inside the sphere. Solutions of higher topological 
charge n carry a magnetic charge 4zn/e. 

As any semiclassical configuration of this kind, the energy of the configuration—i.e., the rest 
mass of the monopole—is proportional to the inverse square coupling constant, 1/e?. *t Hooft has 
shown that it is of order My/a where My is the vector mass acquired through spontaneous symmetry 
breaking. Such a monopole would be extremely heavy! 
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12-5-4 Renormalization of Spontaneously Broken Gauge Theories 


It is clear that the gauge (12-209), the so-called unitary gauge as it involves only 
physical degrees of freedom, is not suited for the study of renormalization, since 
the propagator has a bad large-momentum behavior. We need to use the initial 
gauge where renormalizability is more obvious, but then we will have to show 
that unphysical states do not contribute to the § matrix. 

The key idea is the same as in the case of the spontaneously broken o model 
(Chap. 11). Renormalization is independent of whether the symmetry is exact or 
spontaneously broken. For simplicity of notations, we present the analysis in the 
case of G = O(n) and for a real scalar vector multiplet ¢. The complete lagrangian 
including gauge and ghost terms reads 


“ — ts aa 
Ls?) = —AFyveP a — 5 (uta? ~ 90,D'n + 4D, 6D" — ©? — 2 (g2? 


(12-214) 


The ¢$* coupling constant has been denoted Ay to avoid confusion with the gauge 
parameter. For yu” < 0, the field ¢ acquires a vacuum expectation value (> = v. 
We want to show that the counterterms of the symmetric theory (j.? > 0) suffice 
to make finite the broken theory (u? <0) up to a renormalization of y?. As in 
Chap. 11, we cannot blindly continue p” from positive to negative values, because 
u? = 0 is not an analyticity point. A phase transition occurs at this point. It is 
safe, however, to introduce a small external source ¢ coupled to the field @(x) 
and constant throughout space 


L = Lp?) + c+ G(x) (12-215) 


This explicit breaking induces a vacuum expectation value v of @ parallel to c. 
Indeed, an identity derived for the linear o model [Eq. (11-170)| remains true 
here. If (0) = —m? denotes the inverse propagator of the transverse component 
gr at zero momentum, then 


¢= —vI' (0) (12-216) 


To lowest order, this condition reads ¢ = (u? + Agv*)v and expresses that the 
vacuum expectation value v = <@> makes the lagrangian (12-214) stationary. 

The proof of renormalizability relies on two observations. First, as in the 
linear o model, the generating functional of proper functions for the symmetric 
and the broken theories are related by 


I'(@, ¢, v) = T's(@ + v) — Ts(v) + | at c-o (12-217) 
or 
with — c= — a (12-218) 


The other arguments of I and Is, such as A,,7, 7, g,..., have been omitted. We 
stress that I is the generating functional of Green functions of fluctuations around 
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the unsymmetric vacuum. This identity is important because we already know 
how to renormalize the symmetric theory. From Sec. 12-4-3, 


l's.r(¢, 4, 7,.-.59,46 w)=T, reg(Z2 ZV7A, DP? N,-. & Qemauvgitl table) 
(12-219) 


Consequently, the symmetric counterterms will also make the broken theory 
finite: 


Pr(d, C, V, A, Bienes 1) = r,,(22'76, Co, Vo; ZA, ee we o bu’) (12-220) 


provided v and ¢ are renormalized so as to maintain the finiteness of c- @ and 
the validity of (12-218): 


Co=Zite 8 v9 = Ziv (12-221) 


The second point concerns the variation of u*. Power counting tells us that 
in the unbroken theory such a variation only requires a modification of the 
counterterm du*. The identity (12-217) shows that this modification is also 
sufficient to make the broken theory finite. In this case, we might think that a 
variation of py? also requires a modification of the mass term M4 of the vector 
field, which would invalidate our proof. However, Eq. (12-217) says that this 
modification of M4 comes only from the variation of v as a function of py’. 

In conclusion we may reach any point of the (m’, v) plane (Fig. 12-12) and 
renormalize the corresponding theory with the symmetric counterterms. Modifying 
only the yu? counterterm, we renormalize the spontaneously broken theory 
[ec = 0, m? = 0, v given (point 6 on Fig. 12-12)]. See also Fig. 11-14 for plots of the 
(u?, m”) or (7, v) planes. 

The corresponding normalization conditions of the various proper functions 
have not been explicited. They can be deduced from the identity (12-217) and 
from the normalization conditions of the symmetric functions. Instead of this 
intermediate renormalization, we may prefer more physical conditions, such as 
those defining the coupling constant as the value of the three-point function at 
some on-shell point, etc. Needless to say, these new normalization conditions 
must be in agreement with the identities derived from (12-217) and from those 
satisfied by T's. 


Figure 12-12 Curves of constant py? and c in the 
(m?, v) plane. 
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The method followed here is economical, since it appeals to the simpler symmetric theory in order 
to renormalize the spontaneously broken one. It is, however, possible to avoid any reference to the 
massless unbroken case. The problem then amounts to show that the Slavnov-Taylor identities 
satisfied by the spontaneously broken theory may be preserved by renormalization. 

The previous analysis has been carried out in the renormalizable gauge where the comparison 
between broken and unbroken theories is obvious. This gauge is not physically satisfactory since 
it exhibits unphysical features such as the massless modes of ¢7 (m? +0 when c > 0). However, the 
study of renormalization in the unitary gauge such as the one in Eq. (12-209) would be much more 
difficult, as the theory looks nonrenormalizable and contact with the symmetric case has been lost. 


12-5-5 Gauge Independence and Unitarity of the S Matrix 


We want to show that all unphysical states—fictitious Goldstone bosons, 
additional polarization states of the vector field, and Faddeev-Popov fields—do 
not actually contribute to S-matrix elements. All these unphysical fields have 
propagators with a pole at k* = 0. It is tantamount to showing that this singu- 
larity does not contribute to intermediate states. 

A simple proof uses the gauge independence of the S matrix to introduce the 
*t Hooft gauge 


F, = ee _ = ( r6)| (12-222) 


We assume that the group is simple and hence has a single coupling constant, 
that the scalar field @ belongs to a real representation, and that ¢’ is obtained 
after a translation ¢ = ¢’ + v, <d’> = 0. This gauge has several merits and its 
invention by ‘t Hooft was a major step in the theoretical developments. It breaks 
explicitly the global invariance. Consequently, the would-be Goldstone bosons 
acquire a mass matrix 


2 
—4(m’ 4)ap$'ab's = — oi (ba T apve)(', T*, avs) (12-223) 
Moreover, the Faddeev-Popov ghost also acquires a mass. The operator My reads 
23 
Mx{X, y) = \2.D ~ = T°T'(h' + on| 6*(x — y) (12-224) 


since it results from an infinitesimal gauge transformation in % acting on both 
Aand ¢’. The ghost-mass matrix is therefore 
2 


(m2) a» = (Tv, T*v) (12-225) 


Finally, this choice diagonalizes the quadratic form in A and ¢’. The crossed term 
in the expansion of —(A/2)F? just cancels —g(0,¢’, A",T°v) arising from 
4(D,,, D*¢). It follows that in terms of the mass matrix of Eq. (12-210), the 
vector propagator reads 
aa keke 
Wk) = Z Ci a 12-226 
w= peel" * oe | ( 
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As 24-00, one recovers the Feynman rules in the transverse (Landau) gauge, 
while as A > 0, all the unphysical masses recede to infinity. In the latter case, we 
do not expect these states with enormous masses to contribute to the S matrix. 
We proved in Sec. 12-4-4 that the § matrix does not depend on the choice of 
gauge. The argument which was formal due to the infrared divergences is now 
justified. We conclude that in any gauge, and in particular in the Landau gauge, 
the unphysical states do not contribute. A careful analysis should pay proper 
attention to renormalization. On this point the reader is referred to the literature. 


Even though unphysical particles have disappeared from the physical subspace. there remains some 
trace of the spontaneous breaking mechanism, namely, the (massive) components of the scalar fields. 
Besides these scalar Higgs fields, we recall that some components of the vector field may remain 
massless. 

We may wonder whether it is mandatory to introduce scalar fields and whether it is not possible 
to generate them as bound states, for instance, of a fermion-antifermion pair. Such a dynamical 
breakdown is illustrated by the Schwinger two-dimensional massless electrodynamics. The vacuum 
polarization has a pole at zero momentum, the fermions disappear from the theory, and the only 
remaining single particle state is a bosonic bound state of mass e , 7 In spite of the appeal of such 
a mechanism, it is not presently known how to realize it in four dimensions. 


12-6 THE WEINBERG-SALAM MODEL 


We present a realistic unified model of weak and electromagnetic interactions 
proposed independently by Weinberg and Salam and based on a spontaneously 
broken gauge theory. Among all the models of this type, it may be singled out 
because of its anteriority, its economical number of parameters, and the fact that 
it has received some experimental confirmation with the discovery of neutral 
currents and of charmed particles. 


12-6-1 The Model for Leptons 


The electron and its neutrino v, are treated on the same footing as the muon and 
its neutrino v,. The left helicity component of the charged lepton e; = (1 — 75)e/2 
[uz = (1 — ys)u/2] and its neutrino v,(v,) are grouped into a column matrix 


Ve Vp 
L.= (‘:) Gas a (12-227) 


This suggests the introduction of a group of leptonic isospin for which L, and L, 
are doublets, while the right components er = (1 + y5)e/2 = R, and ur= R, are 
singlets. A leptonic hypercharge Y is also assigned to each of these fields in such 
a way that the analog of the Gell-Mann and Nishijima rule is satisfied : 


i 
Q=T?+ 2 (12-298) 


The left doublets have Y = — 1 and the right singlets Y = —2. The weak isospin 
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T and hypercharge Y commute; therefore the transformation group is 
SU(2) x U(1). 

We then construct a gauge theory with this invariance group, involving a 
triplet of gauge fields A, for SU(2) with a charge g and a field B, for U(1). The 
U(1) coupling constant will be denoted g’/2. Since we want a single gauge field 
(the photon) to remain massless after spontaneous breaking, we introduce a 


doublet of complex scalar fields: 
+ 
ie i ) (12-229) 
Po 


of hypercharge Y = +1. The most general renormalizable invariant potential for 
d is 
V(p'd) = wh" + ACHP)? (12-230) 


For p? <0, ¢@ acquires a nonvanishing vacuum expectation value, which may 
be assumed real, along ¢°: 


1 /0 : pu 
ee =-—-—+0 12-231 


D 


The symmetry SU (2) x U(1)yis broken but the symmetry under U(1)g 1s preserved. 
This achieves the desired result, since one vector field coupled to the electric 
charge remains massless. 

The lagrangian reads 


Lf = —}fA,,A” — 4B,,BYY + | Rate —gB)R. + L, (i eof Bg 4.) inp 
ae G(L-Red aa 'R.L.) te eon | 


t ° : + fi i : 
is (0.0 ~iLB,ge-2 And) (08 Ses 2 ang — V($"¢) 
(12-232) 
where A,, and B,, stand for the field strength tensors 
A, = 0,Ay— OA, +gA, XA, Buy = 0,By — 0B, 


and t! (i = 1,2, 3) are the Pauli matrices. The SU(2) symmetry prevents us from 
writing mass terms of the electron and muon, but it does not forbid the intro- 
duction of the coupling to the scalar field with coupling constants G. and Gu. 
In order to understand the physical content of this model, let us use a unitary 
gauge. We use the parametrization 
0 


: i (x) 
(x) = etenei2ny YAP) (12-233) 
Wp 
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and perform the SU(2) gauge transformation 


@) ’ 
ee + 2D 
$(x) > $'(x) = Wp | 


© Aux) + A glo) = 8 (: a, + 
5) pi 2 hi g u 


el! —remee 


B, R invariant 


Equation (12-232) is expressed as 
— : tT! : 
L = —4A,,A”” — 4B,,B* + Ec —g Bengt Le (ie — s Bg a a) Le 


Vane 
9, 
(v + py 


#04 FLW B, 043) + gba" + aay — V] ; | (12-235) 


=) 


(2rer + @1er) + € ou | + 40,p0"p 


The scalar (Higgs) field p has a mass ,/ —2u?. The electron and the muon have 


acquired masses equal to m, = Gev/./2 and tay Gyv//2. The charged vector 
field 


il 
Wy; = 7 Ae + iA?) (12-236) 
is also massive, with a 
My = BE (12-237) 


Finally, the quadratic form in A* and B is diagonalized by 

Zy= (9° + 9)" "(—gAp + g'B,) 
5 Beate (12-238) 
A, = (9° + 97)" *7(gB, + g' Ap) 


so that » 
Mz=-=(g? + g’)'? 
2 (12-239) 
M,sz=0 
The leptonic interaction terms of “ may then be rewritten in terms of the 
physical fields W*, Z, and A: 


itt Ss g 1) —_ ms e v, S 
fen = 37 [vey"(1 — ys)eW," + he] — [tan Ow(2eey"en + Vey"ve + exy"er) 


— cot Ow(éxy"ex — Vey"ve)|Z, — eA,ey"e + {e<op, ve v,} (12-240) 
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~ Eq. (12-240) we have introduced the Weinberg angle Oy, defined in such a way 
that 


i , 


oe tan Oy = 2 


V9 +9" g (12-241) 
or e=g' cos Ow = g sin Ow 


The last term in (12-240) is the usual electromagnetic coupling to the field A,,. 
The first one has the form (12-176). Its coupling is related to Fermi’s constant 
through 


Ce ge? _ 
> SNe 2r2 


(12-242) 
aN 
From the knowledge of G. we deduce lower bounds on the masses of W* and Z: 


3 
epee Gey = 48 Gev 
2 ~=sin 6 
Mz= eh tl —— GeV > 76 GeV at) 
2" 2 cos Oy ~—ssin sin 20y 
My < Mz 


The coupling constants G, and G, are determined from the electron and muon 
masses: 


G.= ~- = 294GN2m,~ 2 x 10-§ 
(12-244) 
G,=—*G,~4x 10-4 
es 
This model involves a new type of coupling of Z, to a neutral, parity violating 
weak current constructed from e;, ér, and v, (and juz, r, V,). This is a feature of most 
renormalizable models of weak interactions. They introduce either a neutral 
current and a vector field coupled to it, or new leptons assumed to be heavy 
to comply with experimental facts, or both. In the former case, the explicit form 
of the current depends on the model, viz on the choice of representations for the 
various fields, etc. The Weinberg-Salam model incorporates in a natural way the 
electron-muon universality. Only G, and G, are sensitive to the nature of the 
lepton. On the other hand, the model does not provide any natural explanation 
for the electric charge quantization. 


This is not the case for other models based on simple groups such as the Georgi-Glashow model 
(Sec. 12-5-3). As a model for weak and electromagnetic interactions, it is now ruled out by experiment 
since it does not incorporate neutral currents. Among the three components of the gauge field, two 
become massive (the analogs of W,* in the Salam-Weinberg model) while the last one remains 
massless (the photon). The benefit of dealing with a simple group is that the electric charge is 


quantized. 
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12-6-2 Electron-Neutrino Cross Sections 


To expose some striking consequences of the existence of neutral leptonic currents, we will now 
compute the elastic e — v cross sections to lowest order in the Weinberg-Salam model. The relevant 
diagrams are depicted in Fig. 12-13. In the limit where the incident neutrino energy is small as 
compared to the masses of the W and Z bosons, we may content ourselves with the effective 
lagrangian 


2 pe int = —/2G{[iey"(1 = ysvel[ey,A1 = ys)ve] 
+ (Tyy"vy + Fep"ve)(2 sin? Oweryuer — COS 20weLynex)} (12-245) 


where the first and the second terms represent the W and Z contributions respectively. After a Fierz 
transformation on the first term 


i] = = 1-—} 1-}>y 
(sa se e)(e- atte v) - (a = -)( na) (12-246) 


forint = = -Ji64(var 


“ (>. ; sb Ps ; YS 49 sin? one (12-247) 


A general expression of the form 


ies fam ee 
Loa= -Jio(m “ - »)fera(Ce Lee oP rye | (12-248) 


where C, and Crare real coefficients, leads to a cross section 


do 
dE’, 


this interaction is 


G? 
= Sum Ez LCE a)? + Chala’? — CrCeme(q- 4)" (12-249) 


for the process e(p) + v(q) > e(p’) + v(q’). The expression for the process ev > ev is obtained by inter- 
changing C, and Cr. When the incident neutrino energy is much larger than the electron mass, we 
have 


(p-q)* = m2 E} 
E’.\? 
(p:q')’ = mE; (.- ) 


[Be 
E'.m 
~m = mE — 
qe pop’ = E. E, 
viv viv Ve e 
v, v, 
Z Ww =a 
e e 
e e Fe F 


fe 


Figure 12-13 Electron-neutrino scattering to lowest order. 
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and the last term in (12-249) may be neglected. We finally obtain in the various channels 


. ts GME, . ; 
o(¥,e + ¥,e) = = [4 sin* Oy + $(1 + 2 sin? 6y)7] 
G*m,E, : 
a(v.e — v,e) = ae [(1 + 2 sin? Oy)? + $sin* Ow] 
T 
(12-250) 
a Gann, aah : 
o(i,e > v,e) = —— —[4sin* Oy + 4(1 — 2 sin? Oy)7] 
G?m.E, oe. oe 
o(v4e > v,e) = ———— [(1 — 2 sin? Oy)? + $ sin* Oy] 
PAs 


These expressions are compatible with the few observed v,e and v,e events and with the value of 
6yw derived from neutrino-nucleon inclusive reactions 


sin? Oy ~ 0.25 + 0.02 (12-251) 


Experimental investigation of the structure of neutral currents is still actively underway, in 
particular in atomic physics. 


12-6-3 Higher-Order Corrections 


A renormalizable theory of weak interactions enables us to compute higher-order 
corrections. In fact, in the model presented so far, only leptonic processes can be 
considered, such as the weak contribution to the muon anomalous magnetic 
moment or the radiative correction to the muon decay. 

The weak contributions to g, — 2 are shown in Fig. 12-14. For this one-loop 
computation, it is safer to use a renormalizable gauge of the form (12-232). The 
diagrams of Fig. 12-14a, b,c give contributions of the form 


d*k = O(km,, m2) 
ot ete Gee 


: d*k O(km,, m2) 
(b) g? | oe Ge mp (12-252) 


mug? [ d*k Oki, mi) 
Mav J k* — ms, [(p — k)? — mi]? 


(c) 


Zz 6° 


(2) (6) a 


Figure 12-14 Weak corrections to the muon anomalous magnetic moment. 
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In cases (a) and (b), M stands for the mass of the W, of the Z, or of one of the 
would-be Goldstone bosons @*. Assume that we have chosen a gauge where the 
latter is very large, of the order of My such as given by Eq. (12-223) with 4 finite. 
The one corresponding to the combination (¢o — b)/./2 is also large, while the 
physical component (do + biy//2 has an unknown mass my,. We assume, 
however, that my, is much larger than m,. In the contribution (c), we have taken 
the muon-scalar coupling 


G,= J2m, = > | (12-253) 
oa Me 


into account. Since the F form factor does not require any ultraviolet subtraction, 
we expect the previous integrals to behave as m2/M? or mz/mj, times a possible 
logarithmic factor. The weak corrections to the anomalous magnetic moment are 
thus at most. of order 


2 Mi 2 -7 
g ve Gm; ~ 10 (12-254) 


This is precisely the order of magnitude of both the experimental and theoretical 
uncertainties (the latter due to the hadronic contributions): 


~ 


g—2 


= (11 659.22 +009) x 1077 
exp (12-255) 
1-2 = (11 659.19 + 0.10) x 1077 


th 


A logarithmic factor In (m2/Mj) might, however, make the weak contributions 
sizable. Notice that the contribution of the Higgs boson is suppressed by a factor 
m;,/Miy with respect to the two others, and is negligible. In the actual computation 
no logarithm appears, and the weak correction is small: 


g—2 
2 


Gm; 

9 4: c 
= ag sin* Oy — 4 sin? Oy + 4) 
weak J2n 


210° (12-256) 


A similar calculation may be carried out for the electron. The experimental and 
theoretical accuracies are higher (~ 10~ ’), but the weak contribution is suppressed 
by an extra factor m2/m7, ~ 107°. 

Radiative corrections to muon decay may also be obtained. The result ig, OF 
course, finite when expressed in terms of renormalized quantities, and amounts 
to a renormalization of the Fermi coupling constant G. This correction of order 
a might be observed in comparison with the coupling in a different weak process 
such as f decay. Unfortunately, there is no other purely leptonic decay, and a 
comparison with a hadronic system involves strong interaction corrections. 
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12-6-4 Incorporation of Hadrons 


A natural way to incorporate hadrons in the previous scheme is to couple quark 
multiplets to the gauge fields. The structure of the charged hadronic current in 
terms of the conventional quarks u, d, s has already been given (Sec. 11-3). It 
contains strangeness conserving and strangeness changing components, with a 
Cabibbo mixing angle. This angle could perhaps be predicted in the framework 
of a gauge theory of weak interactions. In addition to this charged current, a 
neutral current coupled to the Z boson emerges in theories like the Weinberg- 
Salam model. The constraint that strangeness changing processes induced by this 
neutral current should not be in violent contradiction with experiment turns out 
to be a stringent demand, and requires the introduction of at least a new quark. 

We start with the three quarks u, d, s and neglect for the time being their 
couplings to the Higgs scalars. The Cabibbo angle @, is assumed to be given. The 
usual charged hadronic current will be reproduced if we give the following 
assignments to the quark components: 


N= ( 2 ) | (12-257) 


dg =dcos 0,+s sin 6, 


while UR; dr, SR; and Sen 
—3, —%, —4. The interaction 


is a doublet of weak isospin, with Y= 
(s cos @, — d sin @,), are isosinglets, with Y = 


Lint = N, (oa 5 So : - 2) Ni + Ee Z B)sa + (sop > dr) + (Sor > sa | 
+ $irg’ Bur (12-258) 


may be rewritten in terms of the fields W, A, and Z of Eqs. (12-236) and (12-238) 
as 


me ‘ ‘em 
Bint — me == hc) =F 2 a = ae aE ag zt (WS .m — sin? Owjn | 


1 
3; 
4 
3, 


~ 


(12-259) 
with h, = uy,(1 — ys)(d cos 0, + s sin 8.) 


em __ 


ay ly ,u == $(dy,d ag SyuS) 
as expected. The neutral boson Z is coupled to the electromagnetic current and 
to 
NitayaNz = aryus — Cos” O-dry,di — sin 8, cos O(dryySt + StYuds) 


— sin? O,51,VeS. 


The terms proportional to sin 0, cos @, are the strangeness changing neutral 

currents. They are embarrassing because experimentally AS=1 or AS = 2, 

AQ = 0 processes are heavily suppressed. For instance, 
[(K* > x* vv) T(K° opt) 


—=6x1077 


IMS TEE eee EY 107® ~~ (12-260) 
T(K# = all) T(K° — all) 
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This puzzle was solved by Glashow, Iliopoulos, and Maiani (1970) through the 
introduction of a fourth quark, denoted c, of charge 3 like u and carrying a new 
quantum number, the charm. The left-handed component cy 1s supposed to form 
an isodoublet, together with the combination se, = —dr sin 6. + sz cos @,. In 
other words, there are now two left-handed doublets 


(*) (<) ee (12-261) 
do/t \S0/L 


and four right-handed singlets upg, de, Sr, and cr with Y= 4, —4, —4, 3 


respectively. 
It is easy to see that the neutral boson is now coupled to 


Z dg 


Strangeness changing neutral currents occur in the first and in the second terms 
of (12-262) but cancel exactly. This means that the unwanted currents have been 
eliminated to order G. However, strangeness changing neutral transitions induced 
by higher-order exchanges of charged currents might still endanger the theory. 
This is the case of the diagrams drawn in Fig. 12-15, expected to be of order Ga 
and hence in disagreement with the very low experimental rates (12-260). 

However, after introduction of the c quark, the form of the charged current 
becomes 


= eee (5 
(, ine (2) + (G55 tu (“) —ygin? Ope” (12-262) 
L Are 


hy = uy,(1 — ys)de + Cy,(1 — ys)S0 (12-263) 


When ail contributions are taken into account, it turns out that the dangerous 
amplitude is proportional to (mz? — mz)/My. For charmed quarks much lighter 
than the W meson, that is, m, < 1.5 — 2 GeV, there remains no discrepancy with 
experiment. 

Besides this property of suppressing the AS + 0 neutral currents, the charmed 
quark has also an aesthetical appeal. It restores a symmetry between the four 
leptons and the four quarks, and, as a bonus, it removes the anomalies of the 
Weinberg-Salam model. Indeed, the lepton model studied in the previous sections 
is plagued with chiral anomalies. If we apply the analysis of Sec. 12-4 with only 
weak isosinglets and doublets, we readily discover that the only nonvanishing 
anomaly is proportional to 

Det (1° FF jiec ae 


fermions doublets 


—cos 6, 
s 5 7 
K, ar c Vy 
a 3 : e 
cos 6, sin 0. 


Figure 12-15 K° > y* uw” decay. 
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As above, T* is some component of the weak isospin and Y denotes the weak 
hypercharge. This corresponds, for instance, to triangle A“A“B diagrams. It follows 
that leptons give a nonvanishing contribution to the anomaly. With the two 
doublets L, and L, of (12-227), we have 


y Y=2x(-1) (12-264) 


leptonic 

doublets 
However, if we incorporate hadrons according to the previous scheme, the two 
doublets of (12-261) contribute 


Y Y=2x4 (12-265) 
hadronic 
doublets 


If, moreover, we assume ~-as in the discussion of the 2° decay in Chap. 11—that 
quarks come in three degenerate unobservable colors, the expression in (12-265) 
must be supplemented by a summation over color, i.e., multiplied by a factor 3. 
Leptonic and hadronic contributions to the anomaly thus cancel. We emphasize 
the importance of the equal number of leptonic and hadronic doublets—for the 
previous weak isospin and charge assignments—and of the color degeneracy factor 
for the validity of this argument. 

The computation of the 2° decay is not affected by the present cancellation 
mechanism. Indeed, recall that x° is coupled to the divergence of the third 
component of the ordinary axial isocurrent, that is, (i, d)y,ys573/2(4). 

We have omitted so far the Yukawa-like couplings of the scalar mesons to 
quarks and the quark-mass terms. Before spontaneous breakdown takes place, the 
theory must be SU(2) x U(1) invariant, which forbids quark-mass terms. However, 
after spontaneous breaking and shift of the Higgs field, the quarks acquire a mass 


yaa 


@ 7: =" My —y—— Mt (12-266) 
Here ./ is a matrix which is a priori neither diagonal nor real, and is just restricted 
to commute with the charge operator Q. By independent unitary redefinitions of 
the left- and right-handed components, it is possible to diagonalize it: 


Wi = UL 
W'r= UrWr (12-267) 
UpMU] = M diagonal 
Be ee ayia hts a5 uty (12-268) 


The eigenstates y’ of the mass matrix are the quarks u, d, s, c. On the other hand, 
the charged current which reads 


hy x Wy,(1 — ys)t- (12-269) 


630 QUANTUM FIELD THEORY 


becomes = 
h, oc W'y,Al — ys)U{LU aL’ (12-270) 


where U,, and U>, are the restrictions of U, to the upper (u, c) and lower (d,s) 
components respectively. By a further redefinition of the relative phases between 
the quarks [of course, unobservable on (12-268)] this may be cast into the form 


in waite We 
hy = (@2)7,(1 — ma —— Nf (12-271) 


—sin@, cos @,/\s 


in agreement with (12-263). We conclude that the Cabibbo angle comes from the 
mismatch between the eigenstates of the mass matrix and the quark components 
entering the charged current. 


A presentation of models of weak interactions should include a review of their implications —on 
neutrino scattering off hadrons, in particular. It is wiser to refer the reader to more competent 
authors for a thorough discussion. 

In view of the recent experimental discoveries, the preceding theoretical framework may and 
must be extended to include more quarks and more leptons. We content ourselves with a simple 
remark. The Cabibbo mixing matrix which took the simple form (12-271) in the case of four quarks 
may depend on more parameters, some of which may be complex, thus introducing CP violations. 
The recent years have seen a blossoming of theoretical models, involving various groups, multiplet 
assignments, possible right-handed couplings, etc. Any discussion is doomed to become obsolete very 
soon. 

How would strong interactions enter this scheme? For reasons to be discussed in the next 
chapter, a nonabelian gauge theory of strong interactions nowadays seems to be a good candidate. 
The gauge group would be a SU(3). group, unrelated to the Gell-Mann and Ne’eman octet 
symmetry. The eight-gauge fields—the so-called ghuons—would be coupled to the color quantum 
numbers of the quark triplets. In contrast with the spontaneously broken symmetry of weak and 
electromagnetic interactions, this local SU(3). symmetry would be exactly implemented and the gluon 
would remain massless. In this gauge description of weak, electromagnetic, and strong interactions, 
quarks carry the two quantum numbers of color and flavor. On the other hand, vector bosons 
W, Z,..., or the photon are colorless, while the gluons have neither flavor nor charge. 

Finally, it is possible to speculate that the group Gw x SU(3). [Gw = SU(2) x U(1) for the 
Weinberg-Salam model] originates from the breaking of a larger simple group. Such a superunification 
might even extend to gravitation. 


NOTES 
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on a spontaneously broken gauge model. This phenomenon had been discussed 
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Boston, Mass., 1976. 
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by R. Balian and J. Zinn-Justin, North-Holland, Amsterdam, 1976; and W. 
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monopoles were found independently by G. ’t Hooft, Nucl. Phys., ser. B, vol. 79, 
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vol. 139, p. 1, 1978. 

For the muon anomaly, see Chap. 7. The weak corrections have been com- 
puted by K. Fujikawa, B. W. Lee, and A. I. Sanda, Phys. Rev., ser. D, vol. 6, p. 
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CHAPTER 


THIRTEEN 
ASYMPTOTIC BEHAVIOR 


Relations between large-momentum or short-distance behavior and renormaliz- 
ability properties have been discussed in the early 1950s by Stueckelberg and 
Peterman and by Gell-Mann and Low. In the 1970s, under the lead of Wilson, 
Symanzik, and Callan, this subject blossomed. We should not underestimate the 
flow of new ideas which resulted in the confluence with the study of critical 
phenomena, to which are associated the names of Kadanoff, Fisher, and Wilson. 
The troublesome short-distance singularities, which forced upon us the great 
machinery of renormalization, turn out to be the key to scaling properties of 
operators. A set of differential equations may be written expressing the nontrivial 
effect of an infinitesimal change in the scale. Upon integration they reveal that the 
short-distance behavior acquires, in favorable cases, a universal character, with 
fields and composite operators being assigned anomalous dimensions. 

These ideas find successful applications to deep inelastic lepton scattering, 
electron-positron annihilation, and other high-energy processes. The concepts of 
short-distance expansion, asymptotic freedom, enrich the theorist’s arsenal, and 
motivate hopes of developing a fundamental theory of strong interactions. 

The parallel development in statistical mechanics has been tremendous, and 
has achieved notable successes in the comparison with experimental facts. 

This chapter can only be an introduction to this vast subject. We shall avoid 
intricate mathematical proofs and rely mostly on heuristic arguments and 
examples, following the historical development at the price of some repetition. 
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13-1 EFFECTIVE CHARGE IN ELECTRODYNAMICS 


To introduce some of the ideas we begin with the case of electrodynamics 
originally studied by Gell-Mann and Low. 

The fundamental quantity is the electric charge or rather the fine-structure 
constant « = 1/137 measured in low-energy experiments, in atomic physics, say, 
1.e., over distances much larger than the charged fermion Compton wavelength 
(for short we refer to electrons) which fixes the fundamental scale. In high-energy 
experiments we are rather interested in local, quasi-instantaneous properties of the 
interaction. We would expect naively that this regime is dictated by the bare 
parameters occurring in the hamiltonian. Unfortunately, as a result of renormaliza- 
tion the relation between bare and renormalized charges is plagued by infinities, 
at least perturbatively. The manner in which these infinities have to compensate 
imposes constraints which are reflected in the asymptotic behavior. The meaning 
of the word asymptotic will be clarified as we proceed. 


13-1-1 The Gell-Mann and Low Function 


Consider the photon’s propagator and its relation with vacuum polarization 


Oe 3: 

Gpv(q) t Fak Eu a(q?)] 7 Gv(q) (1 1) 
For simplicity no fictitious photon mass has been introduced. The longitudinal 
part Gi, in q,q, plays no role in the following. Normalization of charge is insured 
by the condition (0) = 0, and is therefore fitted to low-energy interaction or 
soft photon emission. The vacuum polarization is a function of q?, a, and m 
the electron mass. We define the effective charge at momentum gq, d(a, q’,m’), 
by considering the combination xG,, associated to photon propagation 


aG (gq) = —i , & d(x, q2,m?) + aG%,(q) 
Mn (13-2) 
1 + @(a, q?, m7) 
d(a, 0, m7) = a 


Subtractions are usually carried out at q? = 0. Consider the effect of picking a 
different subtraction point q* = 1? < 0 (in the euclidean region away from singu- 
larities). The perturbation series will be defined in terms of a parameter a, equal to 


a, = d(a, A*, m?) (13-3) 

In terms of «, the effective charge will be a function D such that 
D(a, q?, m?, A”) = d(a, q?, m*) (13-4) 
and og = D(a, A*, m?, A?) (13-5) 
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Equation (13-4) expresses the apparently empty fact that the physical content is 
not modified by a mere change in the conventions. A different subtraction point 
leads to a modification of «, preserving the function D. The discussion is simplified 
here by the Ward identities of electrodynamics Z,; = Z2, 4 = Z3%o, implying a 
consistent treatment by referring only to the vacuum polarization. 

Since the D function is dimensionless we may write 


(13-6) 


and exploit these relations in the deep euclidean region where — q?/m* becomes 
large. We know in principle the perturbative expansion of the vacuum polarization 
(ad)~' — 1. Order by order we extract its asymptotic behavior by neglecting terms 
behaving as (m/q)[In (—q?/m)]". This defines a function d*(x, —q?/m?) which 
would also result from massless quantum electrodynamics with m? as a scale 
parameter. We therefore explore the domain —g?/m? > x, xIn(—g?/m’)>0, 
hoping that neglected terms do not sum up to a nonnegligible contribution. 
Such an hypothesis cannot be seriously analyzed at this stage. 

The right-hand side of Eq. (13-6) becomes d“(x, x) with x = —q?/m?. If we 
choose the subtraction point such that m? « —/? < —q*, we can neglect the 
dependence on m? of the left-hand side, since we know from Chap. 8 that the 
corresponding massless limit exists. In the limit, D is replaced by D® with A? 


the scale parameter. Setting y = — A*/m? we find 
be 
D* (>. *) = a) (13-7) 
with dy = d(x, y) = D* (ay, 1) (13-8) 


These relations imply that d“ may be considered as a function of a single variable. 
To see this we define, following Gell-Mann and Low, the function 


i) 
V2) = = D*(z, [za (13-9) 


differentiate Eq. (13-7) with respect to x, and set y = x. It follows that 
. 6 
Wax) = x By 4% x) (13-10) 


Using Eq. (13-8) this may be integrated in the form 


sa d*(c, x2) dz 


ae 7 lo (2) oa 
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equivalent to the series 


aa X2) = 2 a (in = [ve mala 


Before giving a general discussion let us recall our earlier results (8-130) for the 
vacuum polarization. Remember that the power of « is given by the number of 
loops. To order «’, 


(13-12) 


z=d%(a,x)) 


31 3 4n? 


with C, a numerical constant. If «, stands for 


Oo eei= — Ss (1 x- ;) ee (In x — C2) + O(a) (13-13) 


a, = d(a, 1)= (13-14) 


oo 
1 + 5a/9n + Co(a?/4n?) + °° 
we have 
Oy . 
1 — (a/32) In x — (03/427) Inx+-° 


To obtain the function (x,), we compute the derivative (6/0x)d*(a1, x) for x 
equal to one. Let us also include the result of the three-loop calculation per- 
formed by Baker and Johnson. This gives 


a™(a, x) = (13-15) 


2 


re z ya 101 : 
W(z) = 3 ate ae tr an Ee =z 2 | O(z°) (13-16) 


where ¢ is the Riemann function 
E(ayi=— 1202e:. (13-17) 


The solution of the Gell-Mann and Low equation (13-10) shows that it is 
possible to obtain the dominant behavior of the vacuum polarization to order n, 
knowing d* and therefore y to lower orders. Let us illustrate this remark by 
restricting y to the two-loop value (z) = 27/3x + z3/4n2 + --- and deriving the 
dominant «(In x)? term in d. From Eq. (13-11), 


das(a, x) dz - ) 
inx= | me (a, x)) — f (a1) 
(13-18) 


3 
f(z) = constant — = = : in z + O(2) 


Upon inversion this yields 
a 
1 — (,/37) In x — (a3/4n7) In x — (a3/2473) (In x)? + O(a} In x) 
5 


en ee) -- (13-19 
1 1 + 50/90 + Cr07/4n? + On” oy 


d*(a, x) = 
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Consequently, we have indeed found the dominant behavior to order a>. More 
modestly, with only the first term in y we would have found the cancellation of 
a?(In x)? terms in the two-loop computation of @, a result which cost us con- 
siderable effort in Chap. 8. Note, however, that it would have been insufficient to 
obtain the coefficient of a In x. 

If a, is substituted as a function of x in (13-19), we recover nonleading 
contributions to a given order. At any rate, the conclusion to be drawn from 
(13-19) is that as — q? goes to infinity the three-loop contribution to w behaves 
as —(a3/24n°)[In(—gq?/m?)]?. The renormalization group therefore allows us to 
derive new results, while it seemed at first that we were only stating trivialities. 
We will show below that the numerical expressions obtained so far are sufficient 
to predict all terms of the form «"[In (—q?/m7)]""*. 

The function = is only known as a power series around the origin. We have 
no guarantee about its convergence. On the contrary we suspect such a series to 
be at best asymptotic (Sec. 9-4). We recall that its range of validity requires 
a In (— q?/m?) > 0 when — q?/m? > oo. This explains why the existence of a pole 
in d in the vicinity of the unphysical euclidean point 


—¢ at m2 eine ~. m2 x 10°°° 


is rather doubtful. Such a pole, sometimes referred to as the Landau ghost, occurs 
when keeping only the first-order contribution in @. Since this singularity is 
clearly in the asymptotic region, a serious investigation would require the com- 
plete expression of d%, all of its terms being of comparable size. It is therefore 
hazardous to draw any conclusions on the inconsistency of quantum electro- 
dynamics at this stage. 

On the other hand, assuming wy to be meaningful, it is possible to speculate 
on the theory as a whole by formulating precise hypotheses on this function. 
It has a universal character independent of normalization conventions as a con- 
sequence of its very definition. 

Let us study a few possibilities. As suggested by the first terms in its expan- 
sion, assume that w(z) is positive in an interval 0 < z<«,. To be consistent 
we suppose that a, is included in this interval. 

If «. is infinite, i.e., if y(z) only vanishes at the origin, it is necessary that 
the integral J* dz/y(z) diverges at its upper limit in such a way that d*(q, x) 
goes to infinity as x - oo. Otherwise d*(a, x) becomes infinite for a nonphysical 
value q? = —m? exp [JP dz/(z)] and the same is likely to be true for the complete 
function d. In other words, we have a genuine Landau ghost. Clearly the first few 
terms in y cannot give a proper indication to test this hypothesis. 

The second possibility is that W(z) has a finite positive zero w,,. For 0 < a 
<%» the positivity of y(z) implies that d*(x) is an increasing function and we 
require |** dz/i(z) to be divergent. Such is the case if W(z) has a simple zero. 
Under these conditions d“(a, x) tends to «, when x— oo. In fact, since w(z) 
decreases in the vicinity of this point, d* tends to a, no matter whether a, 
is smaller or larger than «,,. This property is characterized by saying that o, is 
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an ultraviolet attractor (or stable fixed point). In the case «, = «,, d* would 
reduce to the constant «,,. 


In a more quantitative way let us write, close to z = a4, 


W(z) = v(a.. — z)+°"° v>0 (13-20) 
Integration of (13-11) yields 


2 4 WY) 


where the positive constant k depends on the precise form of y. The critical index 
v[v = —(dw/dz)(x..) > 0] characterizes the approach to the fixed value «,, for 
which the photon propagator assumes its free-field value. Correspondingly, «,, 
would describe the electromagnetic coupling at short distances as if the bare 
coupling were finite. 


13-1-2 The Callan-Symanzik Equation 


Instead of comparing the effect of changing the renormalization point, we make 
seemingly a step backward by returning to the bare regularized Green functions 
using a cutoff A >m. For an appropriate choice of bare parameters we have the 
relation 


d(a, q?,m?) = lim d(a, q’, mg, A”) (13-22) 
A> oo 


The relation as we know was only established perturbatively and does not 
involve any multiplicative wave-function factor due to the Ward identity. Further- 
more, we have 


A2 
oO, Z3 (+. a) (13-23) 
m 


The A > co limit will always be understood, but most of the time it is not written 
explicitly. The crucial observation is that, due to this limiting procedure, the 
renormalized amplitudes depend on one dimensional parameter less than the 
regularized ones. 

To proceed, consider the irreducible function d~ 1 — ~ '(1 +a). The diagrams 
contributing to 


é 
dt) Zee dé *(X0, q. mo, A’) 


Omo 


are superficially convergent. Except for the factor g"’q? — q’q* they correspond 


to the irreducible part of the Green function (Fig. 13-1) 


[as e” 4°* Ol Tj*(x)j*(0) [ety imo: WoWoly): |O> 
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Figure 13-1 Diagrams for mo(0/dmo)d7’. 


The superficial convergence assumes current conservation. The insertion of the 
operator imo J d*y: wy): increases by one unit the degree of one of the denomin- 
ators in the Feynman integrals. However, as discussed in Chap. 8, the subtractions 
of internal divergences due to the insertions of moWoWo in fermion self-energy 
subdiagrams require the introduction of a new counterterm or, equivalently, the 
multiplication of Woo by Z;,- Ultimately 


2 0 
A («. _ 2) = Zz ,m =— d- *(ao,q°, ma, A°) (13-24) 
m CMo 


will be finite in the limit A — oo. To lowest order A is derived from formula 
(7-9). It behaves as —(m?/q*)In(—q?/m’) for large —q?/m?. At order n, A is 
—m?/q? times a polynomial in In (— q’/m?), and therefore its asymptotic limit 
Ags vanishes perturbatively. 

We consider now a variation of m (hence also of x) for fixed a and A. 
According to (13-23), 

é rs] 
ae Olmaca => Tis In ZA len 

If this has a limit as A > oo with « and m fixed, the only dimensionless parameter 


left on which it can depend is «. From dimensional arguments we can therefore 
define 


2 


: 6 7) A 
B(a) = Aim am a Zalasn = iim aA a In 23 (*. aa) (13-25) 


We shall soon verify that B(«) exists perturbatively. For this purpose we compute 
the corresponding derivative of d~*: 


Ones é 6], q 
pn POR daa = E om ue a (. om 
From (13-24) we can also express the left-hand side as 


0 m Omo 
lin m ped eta a= fn 


7) 
x Mo emo d~ (Xo, ie ms, A?) | a, A 
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If B(«) is finite the same should be true for 


mM Omo 


1 + d(a) = iim Ziy 
Om 


(13-26) 


ao,A 


leading to the Callan-Symanzik equation 


2 
[m2 ave A (2 -£)- [1 + 500] (2, -5) (13-27) 
ny m 


From the fact that A,, vanishes it also follows that 


E = + B(a) Z| dz,'(a, x) =0 (13-28) 


Equation (13-28) implies that {(«) is finite perturbatively. It is sufficient to 
use the equation for some definite value of x. Conversely, we may derive re- 
normalizability from a study of this equation and its extension to other Green’s 
functions. 

From the three-loop computation of de Rafaél and Rosner we obtain for 
fermionic quantum electrodynamics 

2a? hd 121 a ; 
Oe of, 14a | OO) ee 
In contradistinction to the function wy, the function f depends on the convention 
used to define the renormalized coupling «, except for its first two terms. The 
above expression shows that for « + +0, f(a) is positive. 

As promised before, let us now derive the leading behavior of the vacuum 
polarization to an arbitrary large order as a consequence of (13-28) and (13-29). 
We substitute in (13-28) the formal series 


de sia 1 + . apa) | 


B(x) = >) baat"** (13-30) 
1 
2 1 
where bi = = a 
and obtain the following equation: 
d n 
2 Fix Pat) = La Vp(a)bn-aer 2 2 (13-31) 


The solution is 


b 
pi(x) = —zinx+C 


[is nL 
pal) = — @ oe + Otnxt) 22 


(13-32) 
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This shows that computations up to the two-loop order yield b; and b2, which 
in turn determine the leading behavior to order n. 

We may reorganize the expansion of dj,! by summing successively leading 
logarithms, subleading ones, .. .: 


= a one & 3 ae a Gr 13263 
dz,'(a, x) = a ( Zinx) +p in(1 2 nx) 1 ( ) 


Unfortunately this method does not enable us to reach d,,"(%, x) for large x. 
For instance, if we were to keep leading logarithms we would recover the un- 
physical Landau ghost by setting 1 — («/3m) In x equal to zero. For this value 
the second term is logarithmically infinite, violating the assumption that it is 
subdominant. Stated differently, the reorganization implied by (13-33) is only 
useful when | « In x| «1. To find the true behavior for large x requires to sum 
up all logarithms or, equivalently, to return to the original equations (13-27) and 
(13-28). 


It is interesting to understand diagrammatically why it is sufficient to know b; and b2 alone in 
order to obtain the coefficient of the leading logarithm to any order. We recall an observation made 
in Chap. 8, according to which when summing gauge-invariant classes of diagrams the leading power 
of In x is equal to the number of internal fermion loops. The reader will find it instructive to derive 
this property by the device of introducing N-degenerate species of fermions and studying the depen- 
dence on N. Diagrams with the largest number of fermion loops at a given order in x are of the 
type shown in Fig. 13-2. The resulting structure shows why a two-loop calculation yields the coefficient 
of the dominant term to order n as proportional to b]~*b3(In x)"~'. 
From (13-25) it follows that the constant Z;3 satisfies the equation 


[aay + 9 (28-1) eronaync 
(13-34) 


Therefore it is possible to study the limit of infinite cutoff » > oo, provided we are willing to make 
some hypothesis on the function f(a). 


The Gell-Mann and Low function w and the Callan-Symanzik coefficient 
B(a) are, of course, related. Take, for instance, the derivative of the expression 


& € © + crossed + 


Figure 13-2 Diagrams with the maximal number (n — 1) of fermion loops contributing to vacuum 
polarization at order n. 
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(13-11) with respect to x», and substitute in (13-28). We obtain 


1 0 
W[das(x, x)] = 7 Bla) = das, x) (13-35) 
which may be specialized to some value of x. A possible choice is x = 1(q? = —m?), 
with d*(«, 1) = a (a) = « — 5u?/9n +--+, and 
1 doy, 
W(a1) = 5 B(x) og (13-36) 


This also shows that the zeros of the two functions are related. If «, is such that 
f(a.) = 0, then WY vanishes for «,, = «;(«,). In particular, when this value is an 
ultraviolet fixed point, d,, tends to «, as x goes to infinity. Furthermore, from 
(13-34) Z; is finite and x,, plays the role of a finite bare square charge. Adler 
has given a thorough discussion of the possibility that «, might be equal to the 
observed physical fine structure constant. Unfortunately, at present we lack a 
definite nonperturbative procedure to evaluate these functions, so that the whole 
matter remains speculative. 


Derive for the other Green functions of electrodynamics the analogous Callan-Symanzik equations. 


13-2 BROKEN SCALE INVARIANCE 


As realized some time after the early investigations of Gell-Mann and Low, a 
more general point of view is to investigate the short-distance behavior of Green 
functions in a renormalizable theory when all relative distances are space-like 
and tend to zero simultaneously. This question may seem purely theoretical 
since it involves amplitudes far off-shell. Luckily this is a wrong impression. 
Indirect means, such as deep inelastic lepton scattering on hadronic targets, enable 
one to probe short-distance interactions. The results of such experiments, antici- 
pated by theoretical considerations of Bjorken and Feynman, partly motivated 
these investigations by Wilson, Symanzik, and Callan. 

The expectation that at large momenta when masses become negligible the 
theory becomes scale invariant is in fact too naive. The asymptotic behavior is 
given by the corresponding massless theory. Renormalization imposes the choice 
of an arbitrary energy scale as discussed in Chap. 8. This scale spoils dimensional 
analysis. Its very arbitrariness is in fact what saves the day. A change of scale 
may be absorbed into a modification of the coupling constants. The corresponding 
flow is governed by functions similar to the f coefficient encountered above, 
and the renormalization group transformations appear therefore as a substitute 
to naive dimensional analysis. 

Ultraviolet fixed points in this flow (if they exist) will attract the coupling 
constants as A increases to infinity. This leads to the restoration of short-distance 
scale invariance for specific values of the couplings largely independent of the 
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initial data. In particular, the observed dimensions of the fields (or other com- 
posite operators) will in general depend on the dynamics. 

Of special interest is the case where the origin is an ultraviolet fixed point. 
This situation is called asymptotic freedom. Logarithmic corrections to naive 
scaling will then emerge as a result of renormalization. 


13-2-1 Scale and Conformal Invariance 


If in a classical action all dimensional constants vanish we would expect the 
theory to be scale invariant. This could also be the case in a massive theory at 
short distance (typically m|x| < 1). 

If the configuration variables are scaled down 


. xx =1-*x A>0 (13-37) 
the fields, noted generically as ~, would transform according to 
(x) > *e(x) = U(A)e(Ax) (13-374) 


with U(A) standing for a finite dimensional representation of the abelian group 
of dilatations. We shall assume this representation to be fully reducible. This 
means that we may write 


U(A) = eDind 


where the matrix D can be diagonalized. In infinitesimal form, when In / = de, 
the transformation law reads 


é 
69 = -— . 
po = 0€ (x ax + Do (13-38) 


In a classical massless theory (13-37) or (13-38) lead to an invariance provided 
the eigenvalues of D are equal to 1(d/2 — 1) for Bose fields, and #(d/2 — $) for 
Fermi fields. The quantities in parentheses apply to the case of an arbitrary 
dimension d instead of four. 

We can also consider the effect of such transformations in a massive theory, 
obtaining therefore a Ward identity reflecting the breaking of scale invariance. 
In this sense we differ from pure dimensional analysis, since we consider the 
effect of a transformation of dynamical variables (the fields) but not of the dimen- 
sional parameters such as masses. If we were to do so we would relate two 
different physical situations. 

For our favorite example{ 


4 


Qe 


1 24 
L => (09) —™ @ er: 


(13-39) 


{To avoid confusion with the scale parameter A, the g* coupling constant will be denoted g 
throughout this chapter. 
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we find a variation 


— oS a o* m? 


Hence, if D = 1, 
bg 8 ~, 


The integral { d*x 4* Y(Ax) is independent of A (positive). From a differentiation at 


A = 1 it follows that 
d*x Say x 
xa x)= 


meaning that [x-(0/0x) + 4] (x) is a divergence and that the variation of the 
action I reads 


oI = be [a xm2o?(x) (13-41) 
Obviously, when m vanishes we have classical scale invariance. 


Let us show that conformal invariance is then a consequence of scale invariance. 

The conformal group is defined as the set of transformations leaving the angles invariant. This 
carries over to Minkowski space, where we deal with hyperbolic as well as circular angles. This group 
is obtained by adjoining to the Poincaré transformations an inversion with respect to an arbitrary 
point—the origin, for instance, 

u 
xt oe x! ae (13-42) 


x2 


For this definition to be meaningful, the usual R* space must be completed by a cone at infinity. 
Let us introduce a useful geometrical representation. We consider a six-dimensional space with a 
(2, 4) metric, ie., such that 


= (dzo)? — (dz,)? — (dz2)? — (dz3)? — (dz4)? + (dz- 1)? 


The lines belonging to the isotropic cone z? = 0 are identified with the usual R* space completed 
by a Gone at infinity. This may be realized, for instance, by cutting the cone z? = 0 by the hyperplane 
z_, = 1 and projecting stereographically on R* the resulting one-sheeted hyperboloid from the point 
(0, 0, 0,0, —1) in R®* (Fig. 13-3). The explicit formulas are 


Xn 1+ x? 

a ss Sega 
Yuy" — (ya)? = —1 O<ps3 (13-43) 

-—1 

Pee Le cee 

Lye Yat 1 


The action of the four-sheeted pseudoorthogonal group O(2, 4) is reflected on Minkowski space as 
conformal transformations. In particular dilatations correspond to hyperbolic rotations in the plane 


644 QUANTUM FIELD THEORY 


Figure 13-3 Projection of a (1,4) hyperboloid on Minkowski space. 


ily Ball 6 


Vu , _ ¥4 cosh 6 + sinh 6 


-0 , 
x =e xa = . b) = : 
Ye cosh 6 + y4 sinh @ y4 sinh @ + cosh 6 


2, = Zp, 24 =2Z4,c00Sh9+z_;sinh@, z_,=z_, cosh @+ z,4 sinh 0 (13-44) 


As an exercise, find the four other types of conformal transformations, completing therefore the 
number of generators to 15. Write the corresponding transformations in Minkowski space. Perform 
the similar construction in the case of an euclidean four-dimensional! space. The conformal group 
is then O(1, 5) and R*, completed by a point at infinity, may be identified with the stereographic 
projection of a unit sphere in five-dimensional space. 

In order to prove the conformal invariance of the massless @* theory it is therefore sufficient 
to study the effect of an inversion. In five-dimensional y space this transformation corresponds to a 
symmetry y <> —y of the unit hyperboloid. We have to choose a transformation law for the field. 
Equation (13-37a), *@(x) = A(x), suggests the definition 


g(x) > 9'(x) = = Q (3) (13-45) 
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It follows that 


j 14 4 rs) 
l= [atx E (a9'P —g =| =I+ [ax <2 9(x) ( +x° =o 


The additional term is a four-divergence 


4 0 a 
ale ee gp = 20, Saas) 


Formally (1.e., barring possible singularities) the action, and therefore the equations of motion, are 
conformally invariant. 


Exercises 

(a) Express the massless y* theory in five-dimensional space with dynamical variables defined on a 
unit hyperboloid (minkowskian case) or unit sphere (euclidean case). Write the corresponding 
lagrangian and equations of motion. Expand the solutions of the classical free-field equations 
in terms of generalized spherical harmonics. 

(b) Show that the variation of the action of a massive theory under a dilatation [Eq. (13-41)] may 
be written as the integral of the four-divergence of a dilatation current. The latter is related to a 
modified energy momentum tensor (such that its trace vanishes in the massless case) as follows: 


51 [ats 8,S"5e 


Sitmexy re ~ (13-46) 
T* = 0" 0") — gL + 3(9"D — d*0")9" 


For a discussion see the work of Callan, Coleman, and Jackiw. 
(c) Investigate scale and conformal invariance in the presence of Fermi fields. 


13-2-2 Modified Ward Identities 


We perform a Wick rotation and study the euclidean theory. For our problem 
of short-distance behavior this implies no restriction. We write the normalized 
generating functional as 


et) = [20 exp \-1 + [atxio} (13-47) 
The euclidean action J may be split into 


raga te 2 ge" 4m 


A scale transformation g(x) > *@(x) may be considered here as a simple change 
of integration variable, under which 


esd. = (bee daee [atx jooa [ats i)e(Ax) 


The change in the measure is absorbed into the normalization and we recover 
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the naive Ward identity in the form 


2 0 
[as in les | + j(x) ( 4x" ) sat WD x ( jp) (13-49) 


The term in (6/6j)? could be replaced by a derivative with respect to a source 
for the @* operator. Of course, we expect Eq. (13-49) to be modified because 
of renormalization effects. In most of the cases encountered previously, suchi as 
gauge invariance or global symmetries, we were able to display a regularization 
and renormalization scheme preserving the symmetry and hence the Ward 
identities. But chiral anomalies have been the warning that modifications may 
appear in the quantum case. 

Before displaying the aforementioned modifications let us disentangle the 
algebraic structure of (13-49) rewritten as 


Ce 0 \ 6G 6G i ae) a 
[atx(iea(1+ x3 set" | ee] + 2S (j=0)=0 (13-50) 


The Legendre transformation to irreducible functions reads in euclidean variables, 
Eq. (6-97): 


S kad) 


G(j) —- T(¢) = | d*x j(x)p(x) 


6G(j) 
j(x) 
or(@) 
dQ(x) 


4, OZ) Sply) _ ., 
with {a aCiiey = 0" (x — y) 


implying that the kernels —65°G(j)/dj(x)dj(y) and 5?T(o)/de(x ‘)d@(y) are the 
inverse of each other. To simplify the notations we denote them 

d°T (9) te 5°G(j 
ea . Gmeoe me IE 
d(x)dp(y) 0j(x)dj(y) 


P(x) = (13-51) 


id) = - 


I(x, y3 9) = (13-52) 


in such a way that Eq. (13-50) takes the form 


Jeo 2] 


+ m’[—@?(x) + T71(x, x;o)-T7!(x, 0) (13-53) 
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Expand (13-53) in powers of @ to obtain the corresponding identity pertaining to the n-point 
irreducible function. 


To zeroth order in h, (13-53) reduces to the trivial cases encountered in the classical discussion 
(up to the Wick rotation): 


1 m o* 
1} _ |} ga] * ohn, WERE ae 
[tle [4 «|5 (dg)* + Sia +9 =| (13-54) 
é 67 a 2 
| d*x ( ap ee =) | io) =— | dix: (: + «Z)oo| (- Oe+nrot+g i) 
= | d*x m*?(x) (13-55) 


Equation (15-53) is indeed satisfied since [T~ (x, x; @) — [7 4(x, x;0)] is of order h, as is readily seen 
by restoring the factors h (T > T/h, 6/dp > h 6/d@). To order one this quantity may be written 


: 1 1 
Seti d! Abe eS SO 
—-O+m’+ 997/22 -O+m? 


1] 1 5 oy? 1 
ie ite In}{-Ol+m icy eee (13-56) 


and, formally, 


Since 


‘ ae: | re 
4 “| (1 +x =) ot) | =A aA To) |a=1 


it follows that 


[a ; é - or) a ri an 1 1 ) 
RK + x+ - Jg(x) | —-= -—m — i 
; GX ra dex) ton -~O+4+m+g99?/2 -O+m 


(13-57) 


and Eq. (13-53) would be verified, if it were not for the necessary ultraviolet subtractions. 


. 


The functional 
T4(9) = es He —@ 0) L '(cuecs pe) — > (oes; 0)] (13-58) 


is interpreted as generating the irreducible Green functions with an insertion of 
the operator —(m?/2) J d*x (x). This was clear at the level of Eq. (13-49), 
where the term m? J d*x [6/6j(x)]* arose from the path integral over m? f d*x o?(x). 

We check it at order one in Eq. (13-57). The corresponding diagrammatic expan- 
sion appears in Fig. 13-4. We call it a mass insertion. 

Power counting indicates that this insertion introduces logarithmic singu- 
larities in the two-point function at order h!. In operator language the relevant 
Green function is J d*+x <0] Ty?(x)g(y)¢e(z)|0>. Let us specify our normalization 
conditions. Until we are faced with difficulties, it will be convenient to use inter- 
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Figure 13-4 Mass insertion to the one-loop order. 


mediate renormalization at zero momentum: 


(2) 2 
T2(0) = —m? zi (P ) =-1 FRO)=-g (13-59) 
dp p?=0 
where m is proportional but not equal to the physical mass. 

Using a regularization with a large cutoff A and adding the required counter- 
terms to the lagrangian, we obviously modify the Ward identities of scale trans- 
formations. The relation between bare and renormalized irreducible Green’s 
functions is 


T'r(¢, m, g) = 1(Z*/? —, mo, go) (13-60) 


where on the right-hand side the cutoff A appears in the wave-function renor- 
malization Z, the bare mass Mo, and bare coupling go, but the combination 
Tz has a finite limit where A > co with m and g held fixed. Ordinary dimensional 
analysis implies that the dependence on A of the quantities Z, go, mm” is only 
through the combination A?/m?. 

The bare generating functional of mass insertions is 


é 
Ts,o(Y, Mo; Jo) = Mo a T3(9, mo, Jo) | go, (13-61) 


We are now in a position to study scale anomalies. A variation of mo at 
fixed go and A implies variations of the renormalized parameters g and m 
satisfying 


a 
jae — dm += dg (13-62) 


From (13-60) and (13-61), 


d 0 0 
22 1,(Z"2@, mo, go) = G ——+dg ae ; d(in Z) 


No om 


6 
‘ [as (x) 5a [Pate m,g) (13-63) 
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while ordinary dimensional analysis implies 


@ P @ 
"ant | d at +x" =) 09 | 5! Bae sarap PHO. 0) =0 a= (13-64) 


This enables us to rewrite (13-63) as 


do {A 0 logue A 
oa(. aw) 2 — [ar +3 m £-1nZ (2, oo] 


6) é 
ap oat | (x) saat TR(Q, m, 9) 
= Ee Zrmothim 9) Tr Zt, maa (13-65) 
Mo om 


Let us anticipate the fact that the dimensionless coefficients on the left-hand side 
have a finite limit as A > co with g and m fixed. Again from dimensional argu- 
ments they can only depend on g in this limit. We set 


0 A 0 is 
Big)=m = a(*. 20 =—-A Aa(* 7 (13-66) 
1 0 A 1 4) A 
V(g) = sm nz (2, to) = -3n4 m2(4, oo} (13-67) 


As the notation indicates the derivatives are taken at fixed go. Of course, A — 00 
is implied, which means that in perturbative calculations all terms vanishing for 
infinite cutoff are to be neglected. 

According to the analysis of Chap. 8, the mass insertion is multiplicatively 
renormalizable. There exists a constant Z,y2 such that 


Zo:V 4.4(Z"/* Q, mo, Jo) = Ts,x(Q, m, 9) (13-68) 
with I’, r(@, m, g) finite. We define 


7 Lue dmo(A/m, go) 


13-69 
a ae (13-69) 


1 + 6(g) = 


assuming the limit to exist, for a well-defined normalization prescription of the 
insertion. - e 

Dropping the suffix R we obtain the Callan-Symanzik equation in the final 
form 


B(g) 55 — [a xfft (Gg) ee 2 lo or Bae) = 21+ 6@)]Ts (13-70) 
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Comparing this expression to the incorrect formula (13-53) which neglected 
renormalization, we see that it differs by the appearance of terms involving the 
coefficients f(g), y(g), and d(g). Before proving their finiteness, we comment on 
their interpretation. 

First of all, they are similar to the anomalies of chiral invariance, modifying 
the classical Ward identity. We may look at 7(g) as playing the role of a coupling 
dependent addition to the field dimension. The term in {(g)0/eg is a remnant 
of the presence of the dimensional cutoff A in the relation between g and go. 
It implies that an infinitesimal scale transformation has to be accompanied by 
a small change in coupling constant. Finally, 6(g) could be absorbed a posteriori 
in a finite renormalization of the mass insertion. 

To prove that these coefficients are finite we expand (13-70) in powers of 
@ (even powers for the o* theory). In momentum space we obtain 


n 


"YF pe LTD) + BQ) Lp) + (4 — nf + eI PP) 
T ODx og 


= 2[1 + dg)JTX(0; p) = (13-71) 
For definiteness we complete the normalization conditions (13-59) by 
Tr (0; 0) = —m? (13-72) 


which is verified to order #°, and is sufficient to ensure the finiteness of Ij. 
Applying Eq. (13-71) in the vicinity of p = 0 for n = 2 we get two relations: 


7(g) + d(g) = 0 
aF?'0; p?) (13-73) 


Op? p-=0 


y(g) — [1 + 6()] 


which prove that y(g) and o(g) are finite. From (13-71) it then follows that f(g) 
is also well defined. We emphasize that in general these coefficients depend on 
the normalization conditions. In practical calculations it will sometimes be advan- 
tageous to use (13-66), (13-67), and (13-69), which relate them to the divergences 
of perturbation theory. This shows in fact that to lowest order (f'), B and y do 
not depend on any convention. 


The above presentation relies heavily on the bare theory with its infinite cutoff A. A scrupulous 
reader might suspect that it is possible to avoid such a detour, and derive Eq. (13-70) within the 
finite renormalized theory. The price of such an approach is the lack of intuitive appeal. On the 
other hand, the Callan-Symanzik equations may serve as a basis for the construction of the renor- 
malized theory. 

We have emphasized the interpretation corresponding to the modifications of broken scale in- 
variance Ward identities. This is in fact the useful aspect of the Callan-Symanzik equations. They 
may, however, also be interpreted as renormalization group equations. This is achieved, for instance, 
in the massless theory by shifting the dilatation factor from the momenta to the arbitrary subtraction 
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point yz with the result that Eq. (13-71) reads 


F) a 
E aut Big) pat mt) | P(p, g, w) = 0 


It is also clear that the derivation generalizes to other renormalizable theories. The B(g) function 
becomes a vector field in the case of several dimensionless coupling constants. In gauge theories, 
abelian or not, the Green functions depend in general on the gauge parameter A, and a new term 
C(g, 4)0/0A shows up on the left-hand side of Eq. (13-71), where 


. é A 
$(g,A) = m —2|—-, 90,40 
Cm \m 


As Ao is related to the wave-function renormalization Z, of the gauge field through 


A= Z3A0 


(13-74) 


Asgeealy 


we have 


Cg, 4) = 2Ay(g, A) 


in terms of the anomalous dimension y = 4m(é/ém) In Z3. In nonabelian theories, the latter, as well 
as anomalous dimensions of other matter fields, depends on J. This is not the case for f(g), at least 
for certain normalization conditions, as noticed in Chap. 12. 

Equation (13-71) may be compared to the analogous result (13-28) in electrodynamics. Note 
that there exists a relation between f and the anomalous dimension of the electrodynamic field y. 
This is a consequence of the Ward identity: e? = eZ3. 

The previous analysis also generalizes to functions involving composite operators. Each set of 
operators with the same dimension is multiplicatively renormalized by a renormalization matrix Z 
[see (8-69)]. The proper functions (of a massless theory, for simplicity) satisfy 


6 fa] 
u— + BG) —— ny(Q) TY + vus(g) TY = 0 
Op 0g i i 


é 
with vig) = —B ap Zin | ao(Z~ "ei (13-75) 


The rationale for the discrepancy of sign between (13-67) and (13-75) will appear soon. 


13-2-3 Callan-Symanzik Coefficients to Lowest Orders 


For the y* theory we could make use of the results given up to order f* in 
Chap. 9. It is perhaps more instructive to derive 8 and y from the counterterms 
of the lagrangian. To be specific we assume here a gaussian regularization 


1 oo ee) or eo (etm yA? 
—— =| dew, da er +m) = —____ 
pam 0 1/A2 ptm 


and compute I''?)(0), (d/dp?)'(0), and (0) as functions of go, mo, and A. 
We will not omit the tadpole terms which must cancel at the end of the calculation. 

For our purpose it is sufficient to compute (0) up to terms in h; we 
recall that in y* theory there is no wave-function renormalization to this order. 
We display the Feynman diagrams to the right of the corresponding expressions. 
As in previous chapters » denotes Euler’s constant. Terms written as constant 
are meant to be independent of A. 
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We find 


(2/9) — 
ry(0) 1 go | m2 In a — mo(1 — »| 
0 


(13-76) 


we Oo 
= ‘oO Gn) 3 (in ae + constant SZ 


#4 2 
3 9 ie 
m 


0 


3 Ne 
Bae Jo E (in 4 x ay =m at +In2)+ constant | 


3 gf hee ie 
-} | (mS — 2%y+1+1n2)In ae 


TO) = 


+o 414027 | 


ogo Ne ie 
~ 4 (an? (* : ae 
According to (13-59) we have 


(2) 
120) = —m?Z7! = 


yO=-Z 5 Ty Oy= —25%4g 


Therefore Eq. (13-76) yields Z, g, and m as functions of mo, go, and A. We sub- 
stitute m? for m3, keep only terms up to order g?, and use the notations 


— eos — _90_ = 
(ny °° np wal 
It follows that 
A2 
Z '=1+— I (in — + constant) + Bie 
Pe OE: A? 
C= hoe 1—; do In7z—y—1—In2 (13-78) 


gi eee te (2 y ee 
a6 A Dies 2 es + constant | + 
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Terms proportional to A? have fortunately disappeared, leaving only logarithms. 
Applying the definition (13-66), 


B(g) ba Cx - CuliinZ, oa 3a 
2(4n)? In A? |g I A? a 


we NM 9 3 9 
ote ee ae 


The factor Z* may be replaced by unity to this order. Reexpressing a in terms 
of « we obtain 


NO 


zd 3 
ae 
B(g) = 3 ne 3 ane + (13-79a) 
Similarly, 
= 0 =i g? 

(g) = Aine In Z eee = 12(4n)* ar (13-79b) 

g* 
6(g) = —y(g) = — D Gn Sa (13-79c) 


All coefficients are finite, as expected. Moreover, the Euler constant y and In 2 
which appeared at intermediate stages as a result of our regularization scheme 
have disappeared. 


Exercises 


(a) Check that f(g) is independent of the conventions to order h?. 

(b) Study the modifications arising from an internal O(n) symmetry group. 

(c) Derive from (13-70) the Callan-Symanzik equations satisfied by the functions Ve,-(@) and Zer(@) 
of Chap. 9 [Eq. (9-116)] and verify that they are obeyed perturbatively by the expressions 
obtained in (9-132). 


We can also use the results of Chap. 12 on gauge fields to get the correspond- 
ing f function. To lowest order the relation between renormalized and bare 
couplings was found to be 


2 
go {11 4 A A 
= 1 —C--= — 13-80 

g tol +f (5 Cc 51; )In ll + O(g6) ( ) 
The energy scale u was an arbitrary device to secure a definition of g in the 
absence of a Higgs phenomenon inducing a genuine physical mass. From (13-80) 
we obtain f(g) to order g*. Let us also add the two-loop contribution computed 
by Caswell, Jones, Belavin, and Migdal: 


Se a eee g {34 20 
B(g) = -fa($e-$n)+4 (- Cel CT + 4C,T;) + 0107 


(13-81) 
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The interesting feature of this result is that, contrary to similar expressions for 
quantum electrodynamics [Eq. (13-29)] or y* theory [Eq. (13-79)], B(g) has a sign 
opposite to g as g 0 (provided T; < 4¢C). We shall later see the importance 
of this remark. The computation of anomalous dimensions is a priori of interest 
only for gauge invariant operators. We will also have to return to this point. 


Exercises 


(a) Show that the computation performed, in quantum electrodynamics agrees with (13-81) when 
we set C = 0, T; = Cy = 1, and write B(a)0/d« = f'(e)d/de. 

(b) Discuss the case of gauge fields coupled to scalar bosons. Obtain their contribution to lowest 
order and verify that it is one-eighth of the fermion one, provided they correspond to a real 
representation. 


13-3 SCALE INVARIANCE RECOVERED 


13-3-1 Coupling Constant Flow 


The exact equation of Callan and Symanzik finds its most interesting applications 
when the right-hand side involving the mass insertion may be neglected. If it 
were not the case we would have to face a cascade of functionals T, Ty, Tay,... 
containing the full complexity of the amplitudes throughout their kinematical 
domains. Here we are only interested in the deep euclidean regime where all 
momenta become large. This is, of course, only meaningful with respect to some 
mass scale. The Weinberg theorem of Chap. 8 comes to the rescue here. Its 
application to a strictly renormalizable theory shows that perturbatively PY” in 
(13-71) is depressed by a power p* (up to logarithms) with respect to the left- 
hand side. Therefore the massless limit of the theory exists provided normalization 
conditions at nonzero momentum be chosen. This offers the choice of an energy 
scale. Henceforth I'{? will denote the corresponding massless Green functions. 
They satisfy homogeneous equations: 


pias é F 
\- ps Pea 4. BO) 5, joo — 1 le oer g)=0 — (13-82) 


Similar relations may be derived for I” and the coherence of our assumption 
checked if the solution is negligible with respect to P”. 

The solution of Eqs. (13-82) will exhibit a structure reminiscent of the relation 
between bare and renormalized theory. The reason lies clearly in the way the 
equations were obtained. The difference will be that infinities are no longer 
involved. A finite renormalization effect will accompany a change of momentum 
scale. To describe it let g(A) be the solution of the differential equation 


d 
AT 9A= BLA] gl) =g (13-83) 


This generalizes to a first-order differential system in the case of several coupling 
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constants. Let us also introduce 


a di! g(A) dg’ 
Al —. ——. y — if — - 
2(A) = exp {| 7 ifaw} exp | BQ) 101 | 2(1)= 1 (13-84) 
Equation (13-82) takes the form 


Ch faye 
Aa yet nta) ATPL *Pii aot =) (13-85) 
meaning that 
Tp; g) = 44~-"2(a)-"T 2 Lp: 9(A)] (13-86) 


The departure from the naive scaling factor 1*~” arises as an anomalous dimension 
[z(A)"] and a change in the coupling constant g(A). 

It remains to find what happens in the limit A > oo. In particular, we have to 
study the limiting behavior of g(A). From Eq. (13-83) it is clear that the crucial 
issue 1s the location of the zeros of B(g). It may occur exceptionally that the initial 
coupling g(1) just coincides with such a zero, call it g,.. In this case g(A) = gq 
independently of 2. 

In general, g(A) will vary as 4 oo; it grows if B is positive or decreases 
if B < 0. This variation is only interrupted when it meets a zero of f. A possible 
situation is an evasion of g at infinity when 4 grows. This would happen if 8 
is of the same sign as g for all g, and vanishes only at the origin. Such a strong 
coupling situation is hard to analyze. 

In order that g(A) tends to a finite limit g,, as 4-00 it is necessary that 
B(g.) vanishes and that (g — g..)B(g) be negative in the neighborhood of gw. 
This is called an ultraviolet fixed point (Fig. 13-5). It could happen that a fixed 
point is ultraviolet attractive on one side and ultraviolet repulsive on the other. 
Such is the case if two simple zeros coalesce. For a simple attractive zero In A 
will diverge as 


ni | 
Iniw~ | g 
Jo — 9 
and g(A) will tend to g,, as an inverse power of 4. A similar analysis can be 
carried out for a multiple zero. On the other hand, nothing spectacular is expected 


Figure 13-5 Ultraviolet fixed point. 
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from »(g) in the vicinity of g.- If yo = Y(Go), then up to a 4-independent factor 
~ Veo 
Ina general situation, attractive and repulsive fixed points appear successively 
along the g axis. According to the initial value g(1), the effective coupling constant 
g(A) will tend to the nearest ultraviolet attractive fixed point as A> OO. 


Ultraviolet unstable points, (g — g..)B(g) > 0, are infrared attractors and correspond to limits of g(A) 
when 1-0. This is of interest when we study, for a massless theory, the limit of large distances 
or very small momenta with respect to an ultraviolet cutoff. This is precisely the aim of the theory 
of critical phenomena where m = 0 corresponds to the critical temperature and we investigate the 
long-range tail of correlations. 


Returning to our original problem we conclude from the existence of a nearby 
ultraviolet fixed point that for large 1 the asymptotic behavior of amplitudes is of 
the form 


D(Api; 9) Vas’ (Apis g) ~ ASO Ta (Dis Geo) (13-87) 


Scale invariance has been recovered, corresponding to a specific value g., of the 
coupling and with an effective dimension for the field 


deg) = 1+ Yo (13-88) 


The Green functions with a mass insertion A obey 


a a 
(- Soe on ct BQ) ag 2 —n{i + y(9)] + ral) \T 05 g)=0 


with y,(g) = @ln Z,:/0 InA [see Eq. (13-75)]. As long as y4(y.) < 2. the perturbative hypothesis of 
identifying the asymptotic regime with the massless theory is consistent. Otherwise the mass insertion 
would, in effect, correspond to a hard operator and the theory should be reformulated. See Sec. 
13-3-3 for additional remarks on this point. 


This beautiful reasoning, essentially due to Wilson, shows how the renor- 
malization group has been an instrument in recovering a nontrivial scaling 
behavior. The crux is now to find whether f(g) has such zeros. Since, however, 
the origin is always such a zero, as f(g) vanishes in the absence of interactions, 
it is of special interest to assert perturbatively the nature of this fixed point. 


Exercises 


(a) Discuss the effect of a multiple zero on the asymptotic scaling law and find the corrections to the 
dominant behavior. 

(b) Show that the positive measure in the Lehman-K4llen representation for the two-point function 
implies y > 0 and that y,, = 0 implies a free theory as noted by Parisi and Callan and Gross. 


ASYMPTOTIC BEHAVIOR 657 
13-3-2. Asymptotic Freedom 


We have given examples of the computation of {(g) for small g in the case of a 
self-coupled scalar field and abelian or nonabelian gauge fields. The results are 
represented in Fig. 13-6. For electrodynamics we have plotted the quantity 
4xB(a)/2e [with B(x) given by Eq. (13-29)] as a function of e. 

If the origin is an ultraviolet fixed point we say that the theory is asympto- 
tically free. Among the examples exhibited here, only nonabelian gauge fields with 
a small number of fermions possess this property. Superficially ~* theory would 
also be in this case for g < 0. But it ts likely to be unstable for negative g. Since 
asymptotic freedom would mean that at least for the large momentum regime 
radiative corrections are calculable perturbatively, we would then’ find an un- 
bounded effective potential. We conclude with Coleman that for g < 0 the theory 
is unstable. 

An exhaustive examination by Coleman and Gross establishes the following 
result. No renormalizable field theory is asymptotically free in four dimensions, 
except if it contains nonabelian gauge fields. 


We outline the derivation of this important result. 


(a) Scalar theory 
The ¢* interaction is generalized to an arbitrary set of interacting scalar fields in the form 


Lint = —GijerPiP j PrP 


with gj jx: totally symmetric. To order 4, 
d . 
Bij = A ZA Gije(A) = A ay [Gitmn(A) 9 jemn(A) + permutations | 


with 4 > 0. Assuming the quartic form gj jx:@i9j Px Q positive for stability, we derive that Bj; > 0. 


(a) (b) (c) 


Figure 13-6 The function B(g) near the origin: (a) self-coupled scalar field; (b) nonabelian gauge 
field with 11C > 4T;; (c) nonabelian gauge field with 11C < 4T;, electrodynamics C=O, ips iL 
or Yukawa coupling. 
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(b) Yukawa interaction 
The interacting lagrangian is of the form 


Ly = — Gir PiPj PGi + uP (Aa + iys Boo” 
The positivity of B;;;; may be spoiled, but all Yukawa couplings cannot vanish asymptotically. Indeed, if 
gt = A* + iys BY 
then ‘ 
A= tr [g(A)g"(A)] = 0 


Since abelian gauge fields are not asymptotically free either, the only possibility that remains is to 
include nonabelian gauge fields. 

If Yang-Mills fields are coupled to fermions it is necessary for a simple gauge group that the 
condition 11C > 4T; be fulfilled. If boson fields are also present the situation is more involved. 
First these fields must not be too numerous to keep f(y) < 0, where y is the gauge coupling. Further- 
more, these fields have their own self-coupling. Assuming, for instance, that there exists a unique 
scalar field belonging to the adjoint representation, its self-coupling g, will obey to lowest order an 
equation of the type 


d A Z 5 
A 7) Gold) = Age(A) + Bige(A)g?(A) + Cg*(A) 
with A > 0. The quantity « = g,/g? will then behave according to 
d 
A = a(A) = Aa?(A) + Ba(a) + C 


Furthermore, we have to assume that g, is at least of order y*; otherwise the positive term Az? 
would be dominant and ruin asymptotic freedom. The right-hand side admits two roots x, and a 
of order unity provided B? > 4AC. This requires a delicate balance between the number of fields, 
as added fermions tend to increase the quantity B. If this is satisfied, then g,(4) will behave as 
ag?() when 2 — oo, with « one of the roots of the equation, preserving therefore the required property. 
The general situation with several Bose fields is even more intricate and requires a case-by-case 
detailed study. 

The introduction of such Bose fields may be motivated by the desire of generating mass terms 
through the mechanism of spontaneous symmetry breaking. In practice it is difficult to combine 
asymptotic freedom and symmetry breaking. By the introduction of a Yukawa interaction g-, we 
may at best achieve an unstable situation along an attractive line in the plane (g, Iau) For a 
discussion of this point see the lectures of Gross quoted in the notes. 


The conclusions derived from the calculations of the first few terms near 
g = Orely, of course, on the assumption that the perturbative series is asymptotic. 
When discussing large orders (Chap. 9) we have seen that this is the best that can 
be hoped for. The complete series is likely to be divergent with an essential 
singularity at the origin. Needless to say, to find the behavior of B(g) away from 
the origin remains a challenge for the future. 

The large momentum behavior of an asymptotically free theory becomes 
calculable provided the origin is the nearest fixed point. This is one reason why 
it is so favored by theorists. The name asymptotic freedom is, however, slightly 
misleading, since even then the scaling behavior departs from the free-field case 
by logarithmic factors. With 


B(g) = —bg* + O95) b>0 (13-89) 
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we obtain, for g(A), 


PU) =5 —_ +0 ea (13-90) 
For most of the interesting operators the function y(g) will be of order ae 
7(g) = cg? + O(g*) (13-91) 
Therefore the scaling factor will contain logarithms 
#4) © dg’ 
z(A) ~ exp E \ b “| ~ (2bg? In 4)*/?° (13-92) 


A Green function involving the operator O; will then behave as (2bg? In A)*/2, 
apart from the canonical power of A. 

The existence of asymptotically free theories has far-reaching consequences 
for the purpose of building models of strong interactions. They enable us to 
reconcile two seemingly contradictory aspects. At low and medium energies the 
interactions are indeed strong and complex with numerous resonances. An ap- 
proximate flavor symmetry according to SU(3) or SU(4) gives a qualitative 
description of hadrons as composite bound states of quarks. Every attempt at 
isolating these constituents has failed so far. At higher energies, however, quarks 
have much weaker interactions up to the point where they seem to act as free 
particles. The relevant kinematical domain reached by the experimental constraints 
is the one of light-like separations. We shall see in Secs. 13-4 and 13-5 that it is, 
however, possible to extend the short-distance expansion discussed so far to this 
region. It would then be possible to explain this paradoxical behavior by assuming 
an underlying asymptotically free-field theory. The catastrophical infrared singu- 
larities would be responsible for permanent confinement of quarks. These attractive 
speculations require necessarily such a model to include nonabelian gauge fields 
coupled to an unobserved (and perhaps unobservable) color degree of freedom. 


The generalization to several couplings and to the corresponding multidimensional flow uncovers a 
great richness of phenomena such as stable fixed points, limiting cycles, ergodic behavior, etc. The 
difficulty of obtaining reliable information on this flow away from the origin somehow limits the 
interest of such a study. 


13-3-3 Mass Corrections 


We return to the question of the consistency of the massless asymptotic theory. 
Nonleading powers in the large-momentum behavior of perturbative amplitudes 
have been neglected. They reflect the presence of terms in the lagrangian with 
dimension less than four and corresponding couplings with a positive dimension 
in energy. Typically these are the mass terms. We have already men ou that we 
can define anomalous dimensions for composite operators O; such as 9” or ww, 
thanks to their multiplicative renormalizability. Assume to simplify that the 
corresponding renormalization matrix is diagonal and of the form Z;(A/m, go). 
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Define 


0 A 
LC a 13-93 

vig) A aA In z(* ’ wo) ( ) 
If g is an ultraviolet fixed point, the effective dimension of the operator O; 
will differ from the canonical one d; according to 


deh = di + yi(Guo) (13-94) 


Here we only deal with.the case d; < 4. In general, mass corrections will remain 
negligible as long as Wilson’s criterion is satisfied : 


defi < 4 (13-95) 


This will be automatically verified in an asymptotically free theory where y,(g..) 
vanishes. We recall, however, that logarithmic corrections still affect the canonical 
behavior. 

A closely related procedure has been proposed by Weinberg. It relies on the 
construction of counterterms independent of the renormalized mass up to trivial 
dimensional factors. In other words, we do not fix the latter explicitly. A mean 
to achieve this result is to use dimensional regularization and renormalization. 
Therefore we shall no longer have to neglect the insertion of the mass term 
in the renormalization group equation. In g* theory the solution to the cor- 
responding equation will have-the form 


L.Apisg,m?) = 17 122) "1. (pee), mae (13-96) 
where m2(2) = exp | | eae (0) 


Show that to lowest order the contribution of the one-loop diagram of Fig. 13-7 leads to 


g 
Ya=B3 


(4x)? 


As A>, the condition m’(A) + 0 is y,(g..) < 2, and is therefore equivalent 
to Wilson’s criterion (13-95). Only in this case is a massless scale-invariant 
theory meaningful, even though this condition does not appear perturbatively. 
We should not conclude from the previous analysis that a soft mass insertion 
can be treated perturbatively. Indeed, derivatives of Green functions with respect 
to the mass of a sufficient high order are singular in the massless limit. This is 
due to the fact that after a certain point increasing the number of insertions 
does not improve the ultraviolet behavior (see the discussion of Weinberg’s 


Figure 13-7 Lowest-order mass insertion in the g* theory. 
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theorem in Chap. 8). An intuitive argument is the following. The second (third) 
derivative with respect to m of a bosonic (fermionic) propagator induces in 
general when m0 a logarithmic infrared singularity in a Feynman diagram. 


We therefore expect singularities of the type mi(In ms)*[mp(In mr)*] for small 
masses. 


13-4 DEEP INELASTIC LEPTON-HADRON SCATTERING AND 
ELECTRON-POSITRON ANNIHILATION INTO HADRONS 


We consider here in more detail a subject already introduced in Chap. 11. Our 
aim is to discuss in the next section the implications of a field theoretic model 
for the description of high transfer phenomena induced by leptons. 


13-4-1 Electroproduction 


We begin with a discussion of the electromagnetic scattering of a charged lepton 
(electron or muon) off a nucleon. The process has been depicted in Fig. 11-8. 
The initial and final lepton momenta are / and I’ and at high energy we neglect 
the lepton mass. The initial nucleon of momentum p (mass m) breaks into a 
final state X which is not observed; hence the name inclusive process. Up to 
radiative corrections the electromagnetic interaction acts to lowest order. A 
photon carries the space-like momentum transfer g = | — I' from the lepton to the 
hadron vertex. 

In practice the measured quantities are the laboratory initial and final lepton 
energies E and E’ and the scattering angle 6. We shail not discuss here polariza- 
tion effects. The kinematical invariants are 


— gq’ = 4EE' sin? e 
2 
v=p-q=mE-E) (13-97) 


eer, 
M? = (p+ q)? =m? + 2m(E — E’) — 4EE' sin? 5 


The nucleon being the lightest state with baryonic number one, the stability 
condition M* > m? implies that the Bjorken variable 
2 


eet eee (13-98) 
2v 


satisfies the inequalities 


0<x<! (13-99) 


The upper limit corresponds to elastic scattering. Both notations x and w are 
found in the literature. 
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Let J, be the hadronic component of the electromagnetic current. The scat- 
tering amplitude reads 


es 
Spi = i(2ny*4(py + U = p — Dall)y*u() ra <px|J,(0)|p> (13-100) 


For unpolarized leptons and nucleons the inclusive cross section is therefore 


1 gat 454 [ole ave ee yt 
Pepe CD ort! p= 0(S x) g(r 37"5 


5 D <pldu(0) Px? <Px|J(0)|P> 
pol 


The notation implies a sum over nucleon polarizations. Explicit computation 
yields 


Ley ater rr — gn) 
8 2 
Ww = 7 2 (22)*54(px — p— 4) ¥. <p|J,(0)|px> <px|J(0)| p> 
pol 


> as e'-* (p|J,(x),(0)| p> 


In the kinematical region of interest, W,,, may also be expressed as the Fourier 
transform of the current commutator 


ee = 3 [a e'4°* pl J,(x), Jo(0)] |p> (13-101) 


pol 


Relativistic invariance, current conservation, and parity in the electromagnetic 
case enable us to express W,, in terms of two structure functions W, and W, 
which generalize the elastic form factors [compare with Eq. (3-203)] 


Gudvy y 
Wy = Wir, q’) (9: ca tn) + W2(v, q = “(o, @ w»)( y= a a) 


(13-102) 
The cross section is expressed as 


Be ea (o059 ree 
dQ’ dE’ 4E? sin* (6/2) 2+2sin’ > W (13-103) 
Experimental conditions, sin? (0/2) « 1, make it difficult to extract W,. 


The amplitude W,, is related to the scattering of polarized virtual photons off a nucleon. Call Or 
and as the transverse and longitudinal cross sections (with os = 0 at q? = 0). Show that 
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Dye de 
W,(v, 2) = 6 
8n-~am 
(13-104) 
> 2v+q? 1 
W2(v, q?) Z 


~ Sram 1 — v?/m2q? (os + o7) 


where in o7 and as the flux factor is conventionally computed as if the photons were real with the 
same energy. 

Derive from these expressions positivity conditions for W, and W). For elastic scattering (@ = 1) 
derive the following relations with the electric and magnetic form factors, G_g(q”) and Gu(q’): 


W3'(v, 47) = — - Gix(q7) 5(2v + q?) 


(13-105) 
Wey, q?) = a Gi(q") — £ Gix(q") | O(2v + q’) 
. 1 — q?/4m? 4m? ™ q 
We recall from Chap. 11 that the scattering of neutrinos involves a third structure function W3. 
The cross section for neutrinos (v) or antineutrinos (7) reads [compare with Eq. (11-101)] 
do’ ~~ EE’m do” Ee 7) 0 E+E’ i) 
——— = (6? == 2- we) + 2 sin? — Wr? FE ——— sin? — wer? in 
dQ’ dE’ Te wages 2a (cos ois pees 2 a m al 2 Us Se 
or 
do G’E vW, y\ wW. 
dea "| aa 2. xy? erg oN aad 
dv dy - at f y) m ye #x(1 *) | te 
in terms of the variable oe 
ae y 
MY E mE 


The extra contribution represents the interference between the vector and axial parts of the current. 
In the high-energy limit where the mass of the final lepton (electron or muon) has been neglected, 
the nonconserved part of the current has disappeared. 


The experimental results show that beyond the domain of resonance excitation 
the cross section remains sizable in the very inelastic region, for —q* and v 
very large. For fixed q* the integral over v is comparable with the Mott cross 
section on a point nucleon. Accurate measurements at different angles allow us 
to separate W, and W, to obtain the ratio R= 6s/or with os and or defined 
according to (13-104). Its value is small, R ~ 0.15. The most striking phenomenon 
is, however, the scaling property anticipated by Bjorken and Feynman. In the 
very inelastic domain the dimensionless quantities 2mW, and vW,/m tend to non- 
trivial functions of the scaling variable w = 2v/—q?. This is shown on Fig. 13-8 
where vW,/m is plotted versus the modified variable w’ = (2v + m?)/—q? ~ w. 
Analogous results hold in the case of neutrino scattering. 

These phenomena suggest an interpretation of the scatterer in terms of point- 
like nearly noninteracting constituents, most of which have spin 4 (which cor- 
responds to a ratio R equal to zero). Such a phenomenological parton model, 
as it has been called by Bjorken and Feynman, suggests a number of sum rules 
in agreement with the data. An asymptotically free-field theoretic model gives a 
coherent framework where the constituents are the fundamental quanta: the 


quark and gauge gluons. 
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0.35 
0.30 
0.25 
= 
= 0.20 
a Figure 13-8 The function vW,/m 
ue plotted versus the modified scaling 
variable w’ =(2v+m7)/—q*. The 
0.10 value of R is of the order of 0.18. The 
data are from SLAC and are com- 
0.05 mented in the report of R. E. Taylor 
at the Palerme Conference of 1975. 
0.0 


The values of —q?/m? being large, 
resonances at small w’ have been 
w washed out. 


13-4-2 Light-Cone Dynamics 


Let us investigate more closely the kinematical domain which is probed in these 
deep inelastic experiments. At-high energy, we test the singularities of the com- 
mutator [J,(x), J,(0)]. Singularities occur at short distance (x ~ 0) or for light- 
like separation (x* ~ 0). For the commutator which vanishes outside the light 
cone, short distances in Minkowski space are time-like. To study such a region 
would require us to look in momentum space for large q and small w. The 
physical boundaries are, however, w > 1. In the experimental conditions we can 
write q = An — qo where nis light-like and / large. As 4 «0, —q* ~ 24(n* qo) > 
00, v~ A(n: pp) 00, and w ~ p'n/go'n tends to a finite limit. Qualitatively, we 
expect to probe a region where (n- x) ~ 1/A in such a way that x* ~ O(1/A) ~ 
O(1/ — q”). Thus we are looking at the singular structure of current commutators 
in the neighborhood of the light cone. It is therefore attractive to formulate the 
parton model hypothesis in the so-called infinite momentum frame (Fig. 13-9). 


The dynamical evolution is considered starting from an hyperplane containing a light-like direction 
suited to the case where energies and longitudinal momenta are large and comparable. Without 
entering into details, let us sketch the intuitive reasoning of the parton model. Let P be the magnitude of 
the large longitudinal proton momentum in the center of mass frame, for instance; its four-momentum is 
then approximately p ~ (P,0,0, P). The target is thought of as an assembly of N elementary constituents, 
to be treated during their electromagnetic interactions as free particles with longitudinal momentum 


x;P and negligible transverse momentum with respect to ./—q*. The virtual photon four-momentum 


will be approximated in this frame by q ~(v/2P, ./—q?, 0, —v/2P). The Feynman rules in the 
infinite momentum frame become analogous to those of nonrelativistic perturbation theory. The 
contribution of the ith constituent to the scattering cross section will thus be proportional to 


m 
Qi — O(E’ — E — qo) 
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Po 


D 
= Pr Figure 13-9 Infinite momentum frame. 


with Q; equal to its charge (Fig. 13-10). Since its initial momentum is x;P, we have 


as v q’ v 
ged er ee eee p-x) - Po eee 
ae (x op)  *' 5p” oyPp P 


Therefore the above contribution can also be written 


2 
uo? ™ 3(x+£) 
v 2v 


giving an interpretation of the scaling variable x = m ' as the fraction of total momentum carried 
by a constituent when scattering off the virtual photon. The model is completed by assuming the 
probability for such an occurrence to be f(x) with 


[, ere =1 


0 


For simplicity we have considered the case where f(x) does not depend on the type i of the constituent. 
It then follows that the structure function W2, say, is given by 


y 2 
~ W.0,¢4)= Fa(-£) 
m 2v 


F(x) = YO? xf (x) 


(13-108) 


Figure 13-10 Parton contributions to the structure function. 
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and automatically satisfies scaling. Similarly, mW,(v, q7) is equal to a function F;(— q?/2v). The relation 
between F, and F, depends on the constituent’s spin, being 


2xF (x) = F2(x) (13-109) 


for spin 3, which corresponds to R = 6s/o7 = 0. From the conservation of total energy momentum 
()) x; = 1) we derive the sum rules 


00 d Vidx 
| =e | "Waly, | 2 | ~ Fy(xy= 5 Q? 
o VLm ox 1‘ 


q’ re) 


; (13-110) 
a W2(v, q7) dx F (x) } i 
Faas Sates — y = = — 
(} Vv : od 0 : i N 


2m 


The crucial hypothesis of this approach is the quasi free behavior of partons during the interaction 
with the external current and the neglect of transverse degrees of freedom. 


An equivalent description amounts to substituting free constituent fields in 
(13-101) when estimating the contribution of the hadronic current in the light-cone 
region. The latter will read 


J*(x) = Woody" QW(x) (13-111) 
where w is a fermion free field and Q the charge matrix. Using the anticommutator 
(3-170) 

— i 
(Wd), WW} = 5 Gxe(x? — y°)S[(x — y)?] + OLm?(x — y)?] 


we find 
[FO IM | HOO? rare) — WOO? ywiaT WO) 


a(x?'— 9°) 
paella  ene 

i ee 
When evaluated in a sum over polarizations of a diagonal element, only even 
terms in the <> v exchange contribute. We expand the product of fields in Taylor 
series 


Seba 


= n Be ESS — 
Voi —) = Yee eG, Fh. WO) 
and use 
3(yuYaPv + Y¥aPu) = (GuaIve + JusGva — JuvGap)y* 
to obtain 


: {T(x Jo(—x)] + [Jv Ju(—x)]} 


1 ie 1 
a yy Xu “ Xu, ni OBus M(Guadvp ss Fup Gv a JuvGap) Sn %e(x°)5(x?) 
nodd i Te 


OPH“ = HPO) Q2yF GM GB Hyp(0) (13-112) 


ASYMPTOTIC BEHAVIOR 667 


In the vicinity of x7 = 0 the commutator has been expanded in an infinite series 
of regular local operators multiplying the same c-number distribution. To com- 
pute the tensor W,, in the Bjorken limit we need the matrix elements 


>) ol >, Sq Ob Hl > = dns 1(p*p"':--p' + trace terms) (13-113) 
2 Sol 


The trace terms involving contractions of two indices will not contribute when 
multiplied by a string of coordinates as in (13-112). We have 


roo kfever! soe) 


ws . 
aan 3 ee he 5 DROP Obi “tn D> 


nodd “* 2 bol 


se ) 
X (GuaGvp + GupGva — JuvGap) O* Seely 3) 

We insert (13-113) into this expression and define a function f(x), where x will 
turn out to be the scaling variable (not to be confused with a configuration 
argument), in such a way that 


y (y*p)" rae <4 fa xo LS) (13-114) 


nodd n! 


This means that the matrix elements a,+, are the moments of the distribution 
f(x). Neglecting p? = m? as compared to p- gq, we find 


Wa = | atyer?| acorn 2 


x [pq + xp)y + PAG + XP)y — GuvP* oe £0 55 


Carrying out the integral over y we are led to the expression 


_ F(X) (uv _ Le as )( al 13-115 
Wav = ax [he in)+ * (0, qe Vu v g dv ( ) 


where of course x = w * = —q?/2v. . 
We recognize the results of the parton model with structure functions 
vW. 
+ = F2(x) = mf(x) 
mW, = F,(x) = —— me d (13-116) 


F(x) = 2xF (x) 
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Why is such a simple-minded approximation in good agreement with experiment 
and what are the expected corrections? The answer to these questions is given 
by the existence of an asymptotically free theory. The detailed implications will 
be discussed in Sec. 13-5. 


It is possible to measure the antisymmetric part of the current commutator in experiments with 
polarized leptons and nucleons. Measuring spin in the direction of the incident beam. show that 
do! = dot — A?” 


ee oe Oe y)— (EF — Eves O)(E + Bymetryq’)) 
—q mm 
(13-117) 


where the quantities d(v, q”) and g(v, q*) are defined through 


1 ' 
z| d*x e-9°* (Cp, S|[J,(x), J(0)] |p, S> — <p, —S|[J,(20, Jv(0)] |p, — 5} 


1 
= ray EuvpaG?| — S°d(y, q’) + pS + qg(v, q°)] (13-1174) 


and the nucleon polarization fourvector S satisfies S*p = S? + 1=0. 


13-4-3 Electron-Positron Annihilation 


Electron-positron annihilation into hadrons at very high energies (several GeV) is 
a surprisingly fruitful domain of investigation. The discovery of narrow resonances 
such as the w and y’ and the corresponding spectroscopy will not be discussed 
here because of lack of space. 

The cross-sections are of the order of a few tens of nanobarns (10° 34 cm?) 
and are comparable to the rate of the electromagnetic annihilation e’e >~u7y. 
This process is schematized on Fig. 13-1la. At very high energy where the square 
center of mass energy q? is much larger than the masses it is given by 


d°P, d?P_ e* 
= 2n)* 54 a5 pie 
ei 4 | oarrsegrster (24°) ait OE Pee) 
x } |a(P_)y"v(P+)6(p+)y.u(p_)|? 
pol 
B Ana? ee 80 


1 aes (13-118) 


The hadronic annihilation involves matrix elements of the current J,. For un- 
polarized electrons and positrons, taking into account the hermiticity of the 
current, it assumes the form 


e* 


G 9+ e- _shadrons — 2(42)° (g'"p+ 7 pt p= a p+ p*) 


x 2 aap 5°(ps — p+ — p-)<O|J.(0)| f><F|0)|O> (13-119) 
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Oo 


P, P, 
a : ‘4 
yee a > ‘ =e 


(a) 


-|\>@<| - De 


(c) 


Figure 13-11 (a) Born diagram for e*e~ > p* yx” :(b) the process e* e~ > hadrons; (c) the cross section 
Oe*e~ -shadrons in terms of the hadronic contribution to the vacuum polarization. 


Since q is time-like positive, we can also write 


2 (2n)* O* (pr et = p-)<O|J,(0)| f>< f |J.(0)|0> 
= | d*x ei4°* £0| J,(x)J(0)|0> 
= | d*x e'°* £0|[J,(x), J,(0)] |0> (13-120) 


Annihilation at high energy allows therefore to test the vacuum matrix element 
of a current commutator at short time-like distance. We are led once again to 
investigate the asymptotic domain. However, changing from space-like to time- 
like short distances may be more than an innocent modification. 

The Fourier transform of the commutator in (13-120) is related to the hadronic 
contribution to the forward amplitude through 


| d*x e9°* £0] [J,(x), J(0)] |0> = 2 Im | d*x e9°* (0| TI, (x)J s0)]0>| (icon 


(Fig. 13-11c). This is also the hadronic part of the vacuum polarization tensor, 
up to a factor e?: 


‘| d*x ei * 0] TI(x)J(0)|0> = (Garg? — Gug@"(q?) (13-122) 


Thus 


abn v4 a? 
Oe hadrons — 


Im &"(q?) (13-123) 
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Note that Dot em ayy has the same structure with Im ©" replaced by the muon-loop contribution 
} 


(7-11) 


ie 4m? \#/2 2m? 1 
- »_ 1 4(,_ Amn a ies 13-124) 
Im d,+,-(@°) e? (1 i 7 q | mijq?oo 12n 
Therefore 
162202 7 4nc? 
et em opty = qe Im tpn 4 )= 3q? 


in agreement with (13-118). 


It is also traditional to call R the ratio of hadronic to electromagnetic 
annihilation cross sections: 


R(q’) = O + e~ shadrons = 12xIm o"(q’) (13-125) 
Fete uty 
An unsophisticated application of the parton model yields for the limiting 
value of this ratio 


lim R(q?) = Ra =), Q? (13-126) 
q-7 a i 


by simply adding the contributions of the lowest-order interactions of the charged 
spin + elementary constituents. 

Models for the internal symmetries of hadronic states yield different values 
for -R,. according to the number, type, and charges of the constituents. For the 
octet model of quarks with charges 3, —4, and —4 and three color indices, the 
predicted value is R,, = 2. If additional quantum numbers are present this ratio 
increases. For instance, a charmed quark of charge 3 would contribute an extra 


E(GeV) 


Figure 13-12 The ratio R = o¢'¢ shadrons/Ge*e" +" ,~ aS a function of the total center of mass energy 


p= vie measured at SLAC. The data were compiled by R. F. Schwitters and K. Strauch, Ann. Rev. 
Nucl. Sci., vol. 26, p. 89, 1976. 
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quantity 3, leading to R,, =+2. A massive lepton around 2 GeV produced in 
pairs and not distinguished from the hadronic states increases again R,, by one 
unit. In any case the experimental results as of 1976 are shown in Fig. 13-12, 
and support the idea that R(q’) might stabilize around a value compatible with 
these predictions. A new generation of experiments might provide surprises. 

The asymptotic property (13-126) may equivalently be stated by assuming a 
free-field behavior for the vacuum matrix element close to x = 0: 


Ris I 
<0|J,(x)J,(0)|0> ne Dn* (6,0, a Gualal) 


~0 (x? = iexy)? 


(13-127) 


We can guess the kind of corrections to R,, which follows from an asymptotically 
free-field theoretic model. From the fact that conserved currents are not renor- 
malized (see below), we expect that for large q* the function R is approximately 
given by 


, » 2 a 3 ‘ ; 
RVC grr ae] Rx + gt Ty + | (13-128) 
where y(A) is the running strong coupling constant. The value indicated in (13-128) 
uses the two-loop calculation of the vacuum polarization [ Eq. (13-13)] to estimate 
the term in g*(A) of Im @". The internal fermion quantum numbers are responsible 
for the trace factor T, and R,. is given by (13-126). For large 4, g*(A) is given by 


3(4n)? 
(11C — 4T;) In A? 
according to Eqs. (13-81) and (13-90). Such a correction predicts that R.. is slowly 
approached from above [as (In q*) *]. In an ordinary perturbative expansion the 


dominant contribution would be obtained by expanding (13-124) in powers of 
m?/q?, with the result 


m2 \2 
: R(q?) ~ Ro I = 9(™ a | (13-130) 
q-~7@ 


i.e., a negative and fast varying correction. 


g(a) ~ (13-129) 


We have poorly mistreated the beautiful and numerous experimental results on deep inelastic 
phenomena. Our aim was simply to illustrate an area of particle physics where modern ideas allow 
quantitative predictions which may be severely tested. For nonasymptotically free theories, such as 
quantum electrodynamics, little is known of the short-distance behavior. 


13-5 OPERATOR PRODUCT EXPANSIONS 


The preceding discussion illustrates the interest of studying the behavior of 
matrix elements of products of composite operators in several well-defined limits : 


1. The space-like separation tends to zero (euclidean case). 
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2. The time-like separation tends to zero (minkowskian case). 
3. The square separation tends to zero (light-like limit). 


Problems 2 and 3 are specific of particle physics. In general, the operators 
involved are conserved or partially conserved currents. Problem 1 is the one that 
can be analyzed most thoroughly. The results can be directly applied in statistical 
mechanics. Crucial contributions were made by Wilson and Zimmermann. 


13-5-1 Short-Distance Expansion 


Consider the product of two local operators. For simplicity, we indicate only 
their dependence upon the configuration variable. Wilson has suggested a short- 
distance expansion of the form 


x+ 
A(x)B(y) ~ )) Cu(x — 0v( : ) (13-131) 
x7>y N 
The Oy are a sequence of local regular operators, while the c-number coefficients 
Cy(x — y) are singular in the limit x > y. Perturbatively their behavior is dictated 
by the canonical dimension of the corresponding operators up to logarithms 


lim Cy(x) ~ x?*(mod In |x) 


. (13-132) 
yn = do, —da— dp 
The higher the dimension of Oy the faster the Cy go to zero. Of course, these 
notions will be somehow modified by renormalization group effects. 

When dealing with the euclidean theory the name operator is somehow 
abusive. We have in mind the possibility of constructing generalized Green 
functions G')s(x, y;21,...,Zn) such that the n arguments 2,,...,z, refer to funda- 
mental fields and the remaining two to A and B. The meaning of Eq. (13-131) 
is that when ordering the Oy according to their dimension 


x+y 
lim {] oz ys hye en) io y ere aie y)GS (2 rZhyrees | 


xy 
x Cy, (x — »| =0 (13-133) 


The Wilson expansion is characterized by the fact that the singularities 
generated in the limit x— y are given by the c-number coefficients Cy(x — y) 
independently of the arguments and types of elementary field appearing in Green 
functions. In Minkowski space Eq. (13-133) is understood as an asymptotic 
series in the weak sense of matrix elements between physical states. Clearly this 
is a generalization from the case considered previously, when all separations 
were tending simultaneously to zero. We shall see that it is possible to write 
for the coefficients Cy renormalization group equations. 

Instead of giving a cumbersome general proof, we shall satisfy ourselves with 
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a simple example from a scalar field theory. The operators A and B are taken 
as elementary fields and we look for the dominant coefficient 


PNP(W— C(x — ye (: 5 | Olea y)* | (13-134) 


We expect from (13-132) that C(x — y) behaves at a given order as a polynomial 
in In|x — y|, up to terms of order (x — y)’. To prove this, we consider the two 
sets of conneeted Green's functions G'"*?)(g,, q2, p1,-.., Pn) and GI(q1 + q; 
P1,---,Pn) and study the additional subtractions required in the construction of 
G*: as compared to those already necessary to define G'"*”). An example in- 
volving G and G‘) is shown in Fig. 13-13. 

A Feynman integrand relative to G'j!} may be obtained by identifying two 
external vertices pertaining to G'"**) as a unique one and adjusting the sym- 
metry coefficient. The subtractions to be performed are, however, different. For 
the sake of renormalization, it suffices to consider one-particle irreducible 
functions I'¥?. However, the two external lines to be contracted in G”*?) need not 
be truncated. Let G"*?'(qgy,q2;p1,---,Pn) denote the one-particle irreducible 
(n + 2)-point function with complete propagators on the external lines of momen- 
tum q, and q>. Let #,: and # be the renormalized integrands of I? and G"*”). 
They are related through 


Ry =A+VK (13-135) 


where .Y comes from the subtractions on the renormalization parts containing 
the vertex v of the y? operator. The latter are irreducible Green functions with 
two @ external lines; they are of superficial degree zero if y* is considered of 
dimension two. Following Chap. 8, denote by ¥ the forests of renormalization 
parts, one of them containing v. If t is the smallest renormalization part of 7 


containing the vertex v, we write 
F=aF,UtVFy (13-136) 


xu 


with F, the set of renormalization parts included in t and ; those containing t 
or disjoint with it. Let I be the unsubtracted integrand common to Gi and G"*”), 


a6 OU 


Cf me eee ese <() ae 


Figure 13-13 The connected Green functions G and G{?. 
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the explicit expression for Y reads 


ta YD i (ra f-7] || ro (13-137) 
t,F,F, LyeF, yeF, 


We can organize the summation as follows. We sum over the renormalization 
part t including v, the forests 7, of t excluding t itself, and over the forests 7; 
of the reduced diagram G/t. The latter is a possible diagram for a function 
l. Let o be the part of the diagram pertaining to G'"* 7) corresponding to t. 
It is an irreducible four-point function except for the propagators and possible 
self-energy insertions on the two lines which join at v in t. We call it ree, 

For each t the integrand / is factorized in J = Jg-/,. In an intermediate 
renormalization, the operators T’ represent the Taylor series at zero momentum. 
The forests .#%2 are those of o, 4%; those of G/t, and T* amounts to restricting 
A(c) to its constant term #o(a). This operation is illustrated on Fig. 13-14 where 
one shows a contribution to '“’ with the two-lines candidate to be joined at r. 
The operation T‘ sets the external momenta equal to zero except for the would- 
be circulating momentum k in t. Translated into symbols this means 


S = = RyAG/t)Ro(o) (13-138) 


so that, from Eq. (13-135), s 
R= Agr + Y ReAG/t)Ro(e) (13-139) 


The renormalized integrand for G'""*' is expressed in a way showing the sub- 
tractions implying v which are, of course. compensated by the second term on 
the right-hand side. The notation #,: is a little ambiguous since it still involves 
n+2 arguments and is relative to a Green function which can be denoted 
<0] TLe(x)e(y)]2G(21)°+* P(Zn)|O>. 

After integration over loop momenta and summation over diagrams we may 
write the Green functions as 


Gt )(x,-y, 21,...,2n) = 0] TO(X)G(y) (21) --- P(zn)|0> 
e a*q2 || d*p, 


= » Ona (21)* 5° (0 + G2 + Da pe] 


G 


Pes | ence 
SO iia | Lame ye AG PKG) (13-1404) 


We define the quantities 
0] TLo(x)@G)]2 G(21)°- G(Zn)|0> = idem with 2 > By: (13-140b) 
<0| TL 9?(20)]2 P(21)- + (2,)|0> = idem with A+ By: and x=y (13-140) 
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k Ia, -4, 0 


=e 
=} 
—k+4q, +4, 0 


Figure 13-14 Zero-momentum subtractions on f'™. 


The notation [ ], is to remind us that dimension two is attributed to the operator 
between brackets. For notational simplicity we have used the minkowskian time- 
ordering symbol. Setting 


~(0) = [ats ~(z) (13-141) 


for the zero-momentum Fourier transform, we conclude from the identity (13-139) 
that 


0] Te eOI(Z1)"** G(En)|O> = <O| TLe(x)G(y)]29(21)--* G(z,) 0 
1 = > eee 
+5 o|ro( 5 *) o(? : “J 6s0]0y 


x <0| T| 9° (=22)| P(Z1)*** P(Zn)|O> 


(13-142) 


The index P recalls that the quantity in question is obtained from the irreducible 
I by adding complete propagators on two external lines. The reader might 
find it useful to try out the above operations on an example in order to be 
convinced of the presence of the symmetry factor 3 in (13-142). 

This algebraic construction is well suited to study the limit x — y. Indeed, 
renormalization theory implies the convergence of the subtracted integral. In 
particular, Green functions involving [@(x)g(y)]2 tend order by order to those 
of [g7(x)]2 up to corrections to the type (x — y)*(In|x — y|)*, in the euclidean 
region at least. Consequently, Eq. (13-142) enables us to isolate the most singular 
part of G“"*” in the form 


<0| Te(x)G(y)G(21)*** G(Zn)|0> a C(x — y)<0| T| 6° Gall (21) P(Zn)|0> 
(13-143) 
1 v= a 
C(x — y= 1+ 5 co\To(2 5 ») o(**)aearrior 


To each order C(x) behaves as a polynomial in In|x|. For instance, to lowest 
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order C(x) is proportional to 


dapanes 1 1 
C(x) ta | (2x)* e (q? of m?)? On (2n)3 Ko(m|x|) 
1 m| x| a | (13-144) 


One can refine the above derivation in order to exhibit the successive terms in the 
Wilson expansion. We shall skip this tedious task and, assuming the result, will 
study the consequences of the renormalization group on the coefficient C. Accord- 
ing to Eq. (13-143) this implies the analysis of a Green function at exceptional 
momenta. 

From the above example it is clear that C(x) might contain subdominant 
terms in its perturbative behavior. We have therefore the choice to follow either 
the original Callan-Symanzik method with zero-momentum mass insertion or the 
Weinberg approach of mass-independent normalization conditions. We take for 
definiteness the first option and call Co(x) the asymptotic part of C(x) obtained 
by dropping perturbative subdominant terms. We want to show that Co(x) 
satisfies an equation of the form 


7) 0 
E 3x + BY) ag + 2ye(g) — a0) | Co(x) = 0 (GE ics) 
The connected functions obey 
7) om a r) 
po eee 4 GIERZ) ae ) 
* rn ae ma B(g) aa qe (iea= 25) We (a) Ge" ae) 


= GE* (x,y, z_) (13-146) 


We have absorbed the factor 2[1 + 6(g)] in the definition of A. 
We substitute Wilson’s expansion when x > y and keep only the dominant 
term. In this limit the left-hand side reads 


0 0 ; 
{| x—y)- GEE a Big) a 2ye(9) — sao) |Cols = nh Gy: = | 
f) é é é 
+Cole neg aoe a set Pas, +n +2 + nyoe(g) + (0) 


x 6? G2 a us 2.) 


A Callan-Symanzik equation also holds for Ga of the form 
0 0 6 he 
a + Bg) = g 1 Tt A Mye(g) + ya(g) Gaz (X, Za) 


= Ge yz) (131g) 
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The desired result will follow, provided that in the limit x > y the right-hand 
side of Eq. (13-146) can be identified with C)G” 


A,g2* 
n n 28 4F 
Ga eae, 2, Colx - 6 ( 5 Z 2) (13-148) 


That this is true is a consequence of the same analysis as the one sketched 
above, generalized to Green functions involving a mass insertion A. The latter is 
outside the renormalization part t which contributes to C(x — y) since the only 
divergent function with two ¢” insertions is a vacuum-to-vacuum amplitude. We 
conclude that (13-148) holds and we have therefore proved that Co(x) obeys the 
renormalization group equation (13-145). 

All this may be extended to the successive terms of the operator expansion. 
Returning to a general product A(x)B(y) we see that Eq. (13-132) must be cor- 
rected to read 


E . = + Big) = + d4+yalg) + ds + ya(g) — do, — joa) Cw =0 (13-149) 


with d4, dp, do, the canonical dimensions and jy 4, ys, Yo, the anomalous dimen- 
sions of the operators assumed to be multiplicatively renormalizable. 

Strictly speaking, Eqs. (13-145) and (13-149) are only valid in the euclidean 
domain. They hold in the Minkowski case provided Feynman’s ie is kept finite. 
The genuine minkowskian limit as implied in e* e~ annihilation requires a careful 
discussion of possible oscillations at large momenta. 


Wilson's expansion is only established in the weak sense. For instance, it holds for each function 
Gh with n elementary fields ~ and the operators A and B. It is not guaranteed that (13-131) can 
be naively applied to other Green functions containing extra composite operators. For instance, it is 
not true that 

x 


* lim (0| Te(x)e(y)~7(z)|0> —isequalto = C(x — y)<0| Ty? (“=*) y*(z)|0> 


In this case, since the global function is primitively divergent there will exist new contributions to 
the short-distance behavior apart from those involving two fields g generating the coefficient C(x — y). 
We must also account for the subtraction of the complete diagrain, which will add a new function 
C(x — y) independent of z: 


lim <0| Te(x)(y)y?(z)|0> = C(x — y) <0] Tg? ZX )oraio +€(x—y) (13-150) 
xy 
The conclusions to be drawn from the expansion depend on the non- 


perturbative existence or nonexistence of ultraviolet fixed points. If such a point 
exists we recover the results of a modified dimensional analysis with 


Cy(x) ~ |x| dont Pon(Ga)—da— Valder) 4a 79.0) (13-151) 


In an asymptotically free theory where the functions )(g) are of order g* we 
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write, as in Sec. 13-3, 
B(g) = —bg? +-° 


- ar (13-152) 
Vg) = cg 


Integration of (13-149) yields 
1 \(cates— Con)/2b 


and therefore predicts logarithmic deviations from canonical scaling. 

To apply the above techniques to the concrete examples we have to specify 
the relevant operators, study their conservation laws, and extend the analysis to 
light-like separations. 


13-5-2 Dominant and Subdominant Operators, Operator Mixing, 
and Conservation Laws 


In the simple example of the product y(x)p(y) the operators Oy of the expansion 
are local monomials of the fields and their derivatives compatible with the sym- 
metry properties. For instance, the next subdominant terms in a theory invariant 
under the change of g into —@ are of canonical dimension four and read 


x 


o(S)o(- )= Co(x)92(0) + CH(xV(@@P (0) + C2 re e(0) 


ei) oO) (13-154) 


The notation C2(x) stands here for what was previously called C(x). The number 
of operators grows with the canonical dimension. Even when dealing with the first 
subdominant terms new difficulties arise. We recall that renormalization mixes 
operators of the same canonical dimension and the same quantum numbers. 
Moreover, renormalization is not exactly multiplicative since the insertion of an 
operator of dimension d necessitates counterterms of dimension smaller or equal 
to d. Special conventions are required to disentangle m?x* corrections to C(x) 
from contributions arising in C{(x). In a massless theory, where such problems 
do not arise, we have only to deal with a multiplicative matrix renormalization 
of these operators. The dominant behavior of the C{? is therefore governed by 
the equation 


0 i) . hy 
|: a Bg) ag + dy + yalg) + de + ya(g) — ay fo = advo Cy (x) = 0 
(13-155) 
where (X)"Q) = y\(g) is the transposed anomalous dimension matrix of the 


operators Of) mixed under renormalization [see Eq. (13-75)]. In the case of an 


ultraviolet fixed point g., a diagonalization of yN(9o) will give the observed 
anomalous dimensions. A similar diagonalization is necessary for the matrix c’¥ 


generalizing the constant c in Eq. (13-152) for an asymptotically free theory. 
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ft may happen that there exist relations among the operators Oy as a consequence of a specific 
dynamical scheme. As an example this is the case for the equations of motion. 


The bare connected Green functions of the regularized y* theory satisfy in euclidean space the 
equations of motion 


. go 6 : G 
(= 2 a ti jx) |e) = 0 (13-156) 


é 
dj(x) 3! dj(x)? 
with (— () + mg), the inverse of the regularized propagator. Take a derivative with respect to j(y) 
and let v + y. We obtain an identity which translated on the proper functions reads 


Go 6 
Foi D+ myvotard * 31 Votan = 90) 5565 (13-157) 


The renormalized version of this relation mixes operators of dimensions two and four. As a con- 
sequence there exist functions a;(g) and b(g) such that we have, identically, 


8s (QU yry9 (95 Pa) + 2(Q)EO2(q5 Pa) + a3(g)T (3 2(43 Pa) + B(g)E er 4245 Pa) 


= P'")(pies.-.Da + Gs-+++Pn) (13-158) 
=1 


a 


In the massless theory, giving the dominant behavior the contribution proportional to b will 
vanish. From the Callan-Symanzik equations relative to the two sides of (13-158) we conclude that 
the matrix y'/ pertaining to operators of dimension four must have a zero eigenvalue. 


A similar phenomenon occurs when one of the operators Oy (or a combina- 
tion of the Oy) is the generator of a continuous symmetry, conserved current, 
energy momentum tensor, etc. The corresponding anomalous dimension vanishes. 
We have already encountered this case in electrodynamics, the consequence of 
which was the appearance of a unique f coefficient in Eq. (13-28) for the invariant 
charge. A more general situation was encountered in the o model when a sym- 
metry is softly broken, 1.e., by terms of dimension d less than four in the lagrangian. 
We recall the result of Symanzik showing that the counterterms of dimension 
higher than d could be kept symmetric. In particular, wave-function renormaliza- 
tion is symmetric. 

Under such circumstances let J$ be the bare current (of dimension three) 
and Do its divergence. The latter is of dimension smaller than four by hypothesis 
eea Haat ti-3)]: 


6,Jk = Do 
d(x° — y°)[J8(x), Po(y)] 


with @o the bare field being a vector in the internal space and t the matrix 
_ representative of the generator. The Ward identity 


d* <O| TI6(x)Po(y1) *** Po(Yn)|O? 
= (0| TDo(x)o(y1) ** * Po(yn)|0> 


(13-159) 


TPo(x)O*(x — y) 


a 3 <0 | T@o(¥1)°** TPo(Va) *** Pon) | 0> 64(x — ya) (13-160) 
a=1 
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becomes in renormalized form 
ZT ORK 0 | TJmaeotan) -- aalgr) | 
= Zp'Z"? £0| TD(x)e(y1) °° * Pn) | 0? 


+2" X60] Tolys) + ove) ** Pn) |07 8x — Ya) (13-161) 


if we assume no anomaly. In writing (13-161) we have taken into account the 
fact that wave-function renormalization is symmetric, 1e., independent of the 
component of ~. Since the renormalized functions are finite it follows that Z) = 
Zp is finite and a proper normalization consistent with (13-159) yields 


LZ (13-162) 


As a consequence, exact or softly broken symmetries correspond to currents for 
which 


y= y=9 (13-163) 


For the divergence D to have a dimension smaller than four it must contain an 
explicit dependence on the massive parameters of the theory. In a fermionic 
theory, for instance, where chiral invariance is broken by a mass term of dimension 
three, the axial current conservation is softly broken. In the absence of anomalies 


OlWyty W) = Wipy sy (13-164) 


With a mass-independent renormalization, m depends on the dilatation factor 7 
as in Eq. (13-96). Let y, be the anomalous dimension of the operator yp. 
According to (13-163) we shall have 


a a 2 
4 oi Daa bp. + Ym(g)m ie” Blas + 4 pipalg) 


-Y idee role sy(4s Pa) =0 (13-165) 


The explicit dependence on m has contributed an extra term ),, to be interpreted 
as the anomalous dimension of iy>W, equal therefore to the one of wy: 


Tas ~ My 
This result is not surprising since both anomalous dimensions may be com- 
puted in the chiral limit where they obviously coincide. 


Let us apply these ideas to hadronic symmetries and to their relation with the effective lagrangian 
of weak nonleptonic interactions. 

Consider a model for strong, electromagnetic, and weak interactions based on a gauge theory 
for a product group Gs @ Gw along the lines sketched at the end of Chap. 12. The group Gs will 
typically be the color group [for instance, SU(3)] and Gy will be spontaneously broken down to a 
U(1)9 phase group for the electric charge. 


ASYMPTOTIC BEHAVIOR 681 
} ~-@ 


(a) (b) (c) 


Figure 13-15 Lowest-order contributions to the effective nonleptonic lagrangian: (a) gauge boson, 
(b) Higgs meson, (c) tadpole term. 


To order zero in « and Gr ~ o/Mj the strong interaction lagrangian will read 
£5 = iWDsw — WM — $F 5.,yF 


where M is a mass matrix, the origin of which may be related in part or totally to the breaking of 
Gy. Weinberg has shown that suitable redefinitions of the fields always allow to bring M to a 
diagonal real form without any +5 factor, while keeping the kinetic term invariant. In other words, 
to order zero in x, parity and strangeness are naturally conserved while isospin symmetry requires 
the additional hypothesis M, = M, for the d and u quarks. Let us show that parity and strangeness 
are not violated to order «, but only to order Gr ~ a/M?. 

In order to verify this point we introduce an effective lagrangian for weak nonleptonic inter- 
actions computed from lowest-order exchanges (Fig. 13-15): 


¥ 


f= 4 | d*x A*(x,m?)<TJ,(x)Jy(0)> + Loy + Lo (13-166) 
Only the first term corresponding to gauge bosons has been written. The two additional terms 
represent Higgs-meson exchange &,) and the vacuum expectation value of Higgs bosons leading to a 
renormalization of the mass matrix M. We expect the contribution of Higgs bosons to be of order 
a(m?/M},) at least, with m a typical hadronic mass. 

As regards the first term, apart from the massless photon contribution, the remaining part 
involving the propagation of heavy W and Z mesons may be analyzed using the short-distance 
expansion of the product of two currents. The dominant terms of order « ~ gy lead to an additional 
renormalization of the mass matrix while subdominant terms contain factors «/Mj# ~ G. The matrix 
M + 6M may again be brought to a diagonal real form. In short, to order x, parity and strangeness 
are still naturally preserved. This is not the case for isospin invariance. But even to this day attempts 
at predicting in an absolute way the supposed electromagnetic mass differences have remained without 
real success. 

The same short-distance expansion may be used to understand the dynamical selection rules 
observed in nonleptonic weak transitions, suchas the rule AJ = $,|AS| = 1. The dominant contribution 
arises from dimension-six operators involving four fermion fields. Indeed, operators of dimension three 
only contribute to a redefinition of the mass matrix and those of dimension four may be absorbed in 
a wave-function renormalization. Calculations by Gaillard and Lee and by Altarelli and Maiani 
favour a AI =4 enhancement with respect to the AJ =# transitions by a logarithmic factor 
[In(Mj/m7)]> of the order 5 to 7 according to the models, whereas the observed enhancement factor 
is rather of order 20. The discussion is made difficult by the lack of absolute normalization of the 
_ matrix elements of the operators involved. 


13-5-3 Light-Cone Expansion 


In order to analyze the corrections to the parton model of deep inelastic phenomena 
we have to extend the short-distance expansion. The type of generalization required 
to go over to light-like separations may be anticipated from the free-field case, 
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as in Eq. (13-112). Omitting the indices carried by the currents, we would like to 
show the validity of an asymptotic series 


F{= \a{ — = Vn, Gaile aes eg OnlO) (195167) 
p) D x7 +0 Na ‘ se 

in terms of operators O%'.'“” symmetric and traceless in the Lorentz indices. 
Perturbatively and up to logarithms, we expect the coefficients Cy_.(x) to scale as 


OC eles a ma (13-168) 


with d,; the canonical dimension of J. 

In contradistinction to the previous case, an infinite number of terms con- 
tribute to a given behavior near the light cone, in particular to the dominant one. 
The grouping of terms is made according to what Gross and Treiman have 
called twist, i.e., the difference 


A = do,,— N (13-169) 


between the dimension of the operator and its spin. Strictly speaking the latter 
characterizes the corresponding representation of the homogeneous Lorentz 
group. The fact that we need such an infinity of terms is welcome since the 
matrix elements of this product of currents must give a scaling function F(x). 
The knowledge of such a function is equivalent to an infinite sequence of numbers, 
its moments for instance. When the structure functions of lepton-hadron collisions 
are described as absorptive parts of Compton amplitudes the integer N will be 
an upper bound on the spin in the exchange t channel. 

The leading contribution arises from operators of lowest twist. For a theory 
with spin + fermions, and scalar and gauge bosons this value is two for the 
diagonal matrix elements of electromagnetic currents. These operators are bilinear 
in the fields up to covariant derivatives. They read 


bad —> 
iNo* Da D,, 
N-17, ZF SZ 
LTD 8 malay 2) (13-170) 


N—2 —> Ke . 
: Puy Dy,*** Duy _,F Hy 


Symmetrization and extraction of traces is understood. We have only mentioned 
“physical” fields in this list. For specific subtleties of gauge theories, see some 
remarks below. 

To justify Eq. (13-167) we relate it to the short-distance expansion by pro- 
jecting out a given spin exchange of the crossed Compton amplitude. This quantity 
depends only on the variable q*, the square virtual photon momentum, and is 
dominated in the limit —q’— co by the operator of corresponding spin and 
twist two in the Wilson expansion. This gives a definite information on the 
moments of the structure functions. 

For short we continue to ignore the vector character of the currents and 
the subsequent tensor analysis of the structure functions; we therefore pretend 
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that we deal with the scalar amplitude 
2 | 44. nig>x ci ae 
Aka’. vy= i \ dx e's Galt = J{—- A |p» (13-171) 


For finite g’, v it can be expanded on a basis of orthogonal polynomials in the variable z — ivim./ —q? 
which play the role of a cosine of the angle between the four-vectors p and q. These polynomials 


are orthogonal with respect to the measure dz, 1 — <* and generalize to O(4) the Legendre poly- 
nomials relative to O(3) invarianee. We refer to the work of Nachtmann quoted in the notes for 
details on this projection and the constraints arising from the positivity properties of the structure 
functions. We shall satisfy ourselves here with a simplified presentation. 


Inserting (13-167) into the definition (13-171) we find 
A(q?, v) = | Geaee* y Cx? — ie)x,,°° Ne <p| Our Hy \p> 
Nya ’ 


Some care has been paid to the analytic properties in x space to give an infinitesimal 
negative imaginary part to the variable x”. Define 


<p| O44 | p> = an,a(p' ++ p4* + trace terms) 


3 \N (13-172) 
Crag i(- ig? ) [asx et *Cy (x? — is) 
in terms of which the dominant term in A(q’, v) is given by 
by he 
A(q’,v)~ » ( E | An aC v,a(97) (13-173) 
Nya a 


The contributions of the operators Oy, are related to the coefficients of the 
Taylor series of A in powers of the scaling variable w = x~* = 2p:q/—4q’, while 
experiments measure the absorptive part of A for w larger than one. It is therefore 
necessary to isolate the coefficient of w’ in order to use the short-distance 
expansion to study Cy.,(q7) for large negative g?. In this discussion the variable 
q@ rather than its inverse x is more convenient. 

Since the point w = 0 is outside the experimental reach an analytic continua- 
tion cannot be avoided. This is expressed through a forward dispersion relation 
for the virtual Compton amplitude A, using as discontinuity the structure function 
W(q’, v). The variable in this dispersion relation is the energy v/m, or what 
amounts to the same for fixed q’, the quantity w. It is also necessary to use the 
crossing properties to define the discontinuity for w smaller than — 1. Ina fictitious 
scalar case as here W(q, p) = —W(—gq, p) or W(q?, —w) = —W(q’, —«). In the 
realistic vector case W,,,(q, p) = — W.,(—q, p), which means that all three structure 
functions W,, W2, W3 may be considered as odd in o. 

Further information must be provided concerning the number of subtractions 
k, uniformly for large negative gq’. With Im A = nW, and P,-, being a polynomial 
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of degree k — 1 in w with q* dependent coefficients, we find 


dw’ [(w\* ; 
A(q?, @) = Py-1(q?, @) + | oe eae (2) W(q?, ') (13-174) 
Jw'yjot — 
Expanding near the origin this is also 
day’ 
A(q’, @) = P,-1(q?,0) + ¥ o% | Friel W(q?,o) (13-175) 
N2>k oe 


For N > k we have obtained the desired relation between the moments My(q’) 
of the structure function and the Wilson coefficients 


dw 
M™(q’) -| ower Wg", 0) _ ~ 


jolz1 7 


Y GueGemda?) (42 06) 


For fixed N the leading terms on the right-hand side correspond to twist two 
operators. Positivity conditions on the W result in convexity properties of the 
moments in the variable N. The smaller the N the more sensitive are the M‘”) 
to the asymptotic region since for large N we test mostly the region w ~ 1. 
According to various hypothesis we have the following possibilities for 
M(q?) for large q?: 
constant naive scaling 


M™(q?) ~ { (—q?)~?(92/2 nontrivial fixed point 


(In — g2)-°o./2 asymptotic freedom (13-177) 


The last case seems to be favored by the experimental findings. 


The reader may wonder whether the observed scaling is not consistent with the existence of a non- 
trivial fixed point g.., provided that all yo,(g..) vanish. However, this would imply that the anomalous 
dimension of the fundamental field itself vanishes : 


Yo,Go)=0  VWN=>y(9.)=0 


Together with positivity this entails that the theory is free. Therefore we are left with asymptotic 
freedom as the only possibility. 

We return briefly to gauge theories required for asymptotic freedom in order to survey some 
specific intricacies arising as usual in this case. Products of physically observable operators, and hence 
gauge invariant, can however involve unphysical ones, in particular ghost fields, in their short- 
distance expansion. Such operators occur in the counterterms to Green functions and are therefore 
indispensible to compute physical anomalous dimensions. These additional operators are charac- 
terized using the method of Ward identities developed in Chap. 12 for Green functions without 
insertions. The result of Eq. (12-163) is generalized as follows. Using the same notations as in this 
chapter, a gauge-invariant operator O of dimension d generates counterterms of dimension smaller 
or equal to d with the same quantum numbers as QO, which are either gauge invariant or of the 
form oO’. The second class of operators is stable under renormalization since o2 = 0. Consequently, 
we can organize a computation of anomalous dimensions in a basis of the form {Oi,,,c0’4}. The 


: nv? 
renormalization matrix is then upper triangular by blocks. Only the submatrix in the subspace O! 


. . : . Inv 
enters the calculation of gauge-invariant physical anomalous dimensions. In favorable cases some 
arguments allow us to simplify the analysis by computing directly this submatrix. Finally, physical 
matrix elements of gauge-invariant operators, and their anomalous dimensions, do not depend on the 


gauge parameter. The term in (0/04) introduced in Eq. (13-74) may therefore be dropped. 
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We close this discussion with a summary of results obtained in applying 
gauge theories of strong interactions to leptoproduction. They were derived by 
Georgi and Politzer, on the one hand, and Gross and Wilczek, on the other. 
For electro(or muon)-production, the previous analysis applies to the structure 
functions mW, and vW,;mx, denoted collectively f,(q°, x), and to their moments 


1 
M®)(q?) = \ dx x"~1f,(q?, x) (13-178) 
) 


In that case, the last two series of operators in (13-170) contribute. We have 
to face the problem of mixing of the two types at fixed N. Since we deal with an 
asymptotically free theory, naive scaling is violated by computable powers of 
logarithms 


M™(q?) ~ ¥ Gya(in —q?)~ 4 ee 


where the sum runs over the eigenvector operators of the renormalization matrix. 
In general, the coefficients ay in (13-179) remain unknown. 

The lowest moment N = 2 is the most easily handled. It implies the con- 
served energy momentum tensor T,,, of zero anomalous dimension, together with 
another subleading operator. Moreover, the diagonal matrix element of T,,, 
between proton states is known: 


<p| Ty» | p> = 2pyuPv 


This results in the following sum rules: 


4 1 W. 7) ; 
lim \ dx x[2mW,(q?, x)] = lim \ dx [mae = 8 (13-180) 
q 


—g?>o fa) a 200 6) m 


where u is given in terms of the average square charge <Q?» of fermion con- 
stituents through 


Ty <Q?» 


a 13-181 
2Cr>+ Ty 1+ 2(N? — 1)/nf ( 


w= <0?» 


The last expression applies to f flavor multiplets of fermions belonging to the 
same representation of dimension n of the gauge group SU(N). For instance, for 
the three triplets of SU(3) with charges 3, —4, —4, the value of a Is #5, with 
four triplets of charges 4, —4, —4, 3, a= @,~ 0.12. In the free parton model, 
the corresponding value was a = <Q?» from (13-110). In the asymptotically free 
model, the reduction of a is due to the fact that part of the encrgy momentum 
is carried by neutral gluons. . | 
For N larger or equal to four, it is a reasonable approximation to retain 
only the eigenoperator of lowest anomalous dimension. For large N, Ay behaves 
as In N. 
Although it is hopeless to reconstruct the structure functions from their 
moments without further information. some results may be derived from positivity 
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and from the behavior (13-179). For instance, we may write, for any N, 


1 xte y ial - 1 “ad Vid 2 
fipsv<t| w(?) fay! | (2) fGey) 


1 
< constant y= (In —q’?) 4 (13-182) 


Since Ay ~ In N as N > m, we expect f to decrease for fixed x faster than any 
power of In —q*. This property, together with the sum rule (13-180), implies 
that as — g? grows the structure function has to increase in the vicinity of x = 0. 

The asymptotic regime for the moments is slowly reached since subdominant 
terms are only suppressed by factors of order In [In(—gq?’u7)]/In(—q?/p7). This 
results from the two-loop contribution to the effective charge [Eq. (13-90)]. The 
rate of approach depends on the unknown scale for which the effective charge 
becomes small. Moreover, for finite g*, mass effects may be sizable. 

On the other hand, the moments are not easy to obtain from the experimental 
data, since they involve measurements at very large energy (small x), and for 
any finite q* the change of the x variable into x’ = x + O(1,q?) may somehow 
modify the results. 

In spite of these difficulties, the comparison between theory and experiment 
seems to encounter a reasonable success. It is convenient to rewrite the effective 
coupling of Eq. (13-129) in terms of a single parameter A which sets the scale 


Pie 4 19 
GG) _ = (13-183) 


4n (33 — 2f) In(—q?/A’) 


0.10 


1 2 4 8 15 30 
lq? |(GeV2) 


Figure 13-16 The theoretical prediction for the second moment of the muon-production structure 
function compared with the experimental data of H. L. Anderson et al., Phys. Rev. Lett., ser. B, vol. 38, 
p. 1450, 1977. This drawing and the one of Fig. 13-17 were communicated by G. Altarelli. 
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in the case of the color group SU(3) (C = 3) and for f triplets of quarks (f flavors). 
For f = 4, the best fits are obtained for A ~ 400 + 200 MeV, as of 1977. In Fig. 
13-16, the experimental data for the muon-production moment M®? are com- 
pared with the theoretical prediction. The arrow shows the limit (13-180) (a= 
0.12 for f = 4). For higher moments (Fig. 13-17) only the ratio M“(q*)/M“)(q) 
may be compared with experiment. 

For inelastic neutrino scattering a similar analysis applies to the light-cone 


M(-q?)/MM(-q? = 9) 
i) 


lq? |(GeV?) 


Mog MO GG = 9) 
tv 


= 
lq? |(GeV?) 


5 . = 4 
Figure 13-17 Fourth and sixth moments for muon production, normalized at —q? = 9 GeV’. The 
experimental data are from Anderson et al. (Communication at the Hambourg Conference, 1977.) 


688 QUANTUM FIELD THEORY 


expansion of the product of weak currents (V, — A,)'*'?(x)(Y— Ay)'~ 7(0) in 
terms of the twist two operators: 


(a) a ‘Wy, Dy," -D,,(1 Se ys)W 

- a a 
(b) pW Dy Al Ys) ay (13-184) 
(c) OG 7ie, Die Dela: 


A third structure function {3 = vW3/m [see Eq. (13-106)] is also involved. By 
combining amplitudes for neutrinos and antineutrinos on proton and neutron 
targets, it is possible to isolate a particular channel with given quantum numbers. 
For instance, only the octet operator (13-1845) enters the difference W' — W™. 

The sum rules arising from hadronic symmetries commuting with the gauge 
group continue to be satisfied. Thus Adler’s sum rule (11-105) is verified for all q*: 


| ia (vW' — yW,"?)) = 2m (13-185) 
i 08: 

Others are only asymptotic and approached logarithmically. An example is the 
Callan-Gross sum rule which reads 


| (deo/w® *1) Wi (co, q?) 
1 


eo o(+3) (13-186) 
| ee ie) 
1 


where the quantity in the numerator W, = (2/w)W,; — vW2/m? vanishes in the 
parton model [Eq. (13-109)]. 
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Applications to quantum electrodynamics were discussed by M. Baker and 
K. Johnson, Phys. Rev., vol. 183, p. 1292, 1969, who computed the third-order 
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positivity properties of the anomalous dimensions. 

The asymptotic freedom theorem is due to S. Coleman and D. J. Gross, 
Phys. Rev. Lett., vol. 31, p. 851, 1973. 
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Rev., vol. 185. p. 1975, 1969. and S. D. Drell and T. M. Yan, Ann. of Phys. (New York), 
vol. 66, p. 578, 1971. The relation of the phenomenological model with light-cone 
expansions is reviewed by Y. Frishman in the “Proceedings of the Sixteenth 
International Conference on High Energy Physics, Batavia,” vol. 4, edited by 
J. D. Jackson and A. Roberts, National Accelerator Laboratory, Batavia, IIl., 
1972. The subject is covered in detail by R. P. Feynman in “Photon-Hadron 
Interactions,” Benjamin, Reading, Mass., 1972. 
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179, p. 1499, 1969, and Phys. Rev., ser. D, vol. 3, p. 1818, 1971. A detailed investi- 
gation was carried out by W. Zimmermann, Ann. of Phys. (New York), vol. 77, 
pp. 536 and 570, 1973. 

Applications to selection rules in weak interactions are found in S. Weinberg, 
PS. Ree bem vel. 3), p. 494,1973, Phys. Rev., ser. D, vol. 8, p. 4482, 1974, 
and Rev. Mod. Phys., vol. 46, p. 255, 1974. For the AJ = 4 rule in nonleptonic 
decays see M. K. Gaillard and B. W. Lee, Phys. Rev. Lett., vol. 33, p. 108, 1974, 
and G. Altarelli and L. Maiani, Phys. Lett., ser. B, vol. 52, p. 351, 1974. 
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Rev., ser. D, vol. 9, p. 920, 1974. 
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M. S. Green, Academic Press, New York, 1977, and in the textbook by D. Amit, 
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Hill, New York, 1978. 


APPENDIX 


A-1 METRIC 


Metric tensor: 


1 0 0 0 

Paice tos) 6 
Guw=J = 0 0 -1 0 (A-1) 

0 0 0 -i1 


Derivatives with respect to contravariant (x“) or covariant (x,) coordinates are sometimes abbreviated 
as 

@ 6 
= o¢ = — 


RE OXp 


(A-2) 


Summation over repeated Lorentz (Greek) or space (Latin) indices is understood unless explicitly 
stated: 
VW = VW" = V4*W, = gwV*W" = g'V,W, = VW -V WH Vowo-—Viw' — (A-3) 


A boldface letter denotes a three-vector or the three-dimensional part of a contravariant four- 
vector: 


V={Vii=1,2,3} = {4 V,, hu} (A-4) 


The only exception concerns the three-dimensional gradient 


a ; 
V = {V,, V,, Vz} = (- =0;)=(-d= — Ed (A-5) 
ox' Ox; 


The d’alembertian operator is 
D = 0%0, = 05 — V’ (A-6) 


and the four-momentum operator reads 


p" = id" = {i0°, ~iV} (A-7) 
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Totally antisymmetric Levi-Civita tensor: 


+1 if {u, v, p, 0} is an even permutation of {0, 1, 2, 3} 


give? =< — | if it is an odd permutation (A-8) 
0 otherwise 
Buvpe = — ENP? (A-9) 


Useful identities: 


ets gu'v'n'a’ _ _ det (g**’) = U,V, p,o 
e= i, Vv, pi, 01 
eller vee’ — — det (g™’) a=, p,0 
ef SM, (De Cay) 
ghee po" es — (gre ae fe. g?* 9”) 
aad = 69°" 
Beyer e vp = — 24 
Three-dimensional antisymmetric tensor: 
k= ey, = 1 if (i, j, k) is an even permutation of (1, 2, 3) 
A-2 DIRAC MATRICES AND SPINORS 
The y matrices satisfy 
Wee =o aay eg (A-11) 
with y° hermitian, y' antihermitian, and are related to the 8 and « matrices through 
yp=B y= pa (A-12) 
=P =ip yyy = Hs SuvpeY yyy 
= —iVoy1)2y3 = ip*y*y'y® = yh (A-13) 
y=! 
(A-14) 
tis 7} = 0 
Commutator of y matrices: 
a 5 ieee | (A-15) 
=o = 10" 
[ys,o""]=0 (A-16) 
te £ suroeg 
2 
ysy’'y== where Y= deino% (A-17) 
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Hermitian conjugates: 


= 


p°ysy° = —y$ = — Ys 
V(ysy")y? = (ysy") 


yam")? = (aM 


(A-18) 


For any two spinors W; and W2 and any 4 x 4 matrix I, 


(iP W2)* = Walyol yo (A-19) 


while the corresponding identity for wo anticommutating spin 3 fields involves an cxtra minus sign. 


Charge conjugation matrix: 


CyCo! = — yf 
CysC~} = yf (A-20) 
Co,,C~' ==), 


C(ysy,)C~* = (ysy,)? 


il, ,) a a Fe, ') (A-21) 
1 0 nl Ql 


Pauli matrices: 


Dirac representation: 


-o 0 
ae i 
oh sao @! 
0 -!I 
549 — _jg27@J= (A-22) 
ogee 


, pe ob!) 
oF = tind @ of = &4y,.2* = oul 5 >) 


: 0 —io? 
C= ity? = —io! Bo? = ( 5 ) 


—io 


Cac ==C meee ect C= —-I (A-23) 
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Majorana representation: 


0 


0 oa! 
w= -o'@o'=( 9, 4 


It 


0 —-o 
oe -o'@0=(_", : ) 


oe io? 0 
0 io? 
) =e 
Yo eth 2 
io” ®o e 0 ) 


f Sita 0 
pe-uoee(-i °,) 


<2 
Il 


~2 
i 


Ui 


0 —io 
e 0 
pero @ot=(4 = 
0 —ic? : 
C=—-ia' @ar=| . also satisfies (A-23) 
—io? 0 


Relation with the Dirac representation: 


1 t er a Gees 
Ueaede = UVB racU with U = Ut =— 


Wp 


Chiral representation: 


; —io? 0 
= —io? @o7 = ( " satisfies (A-23) 


(A-24) 


(A-25) 
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Relation with the Dirac representation: 


F iu le —f 
Vehical me Urb iracU" with OS = ¥s)0) = = ( vs 
/2 a2 
_ Contraction identities: 
49 = ab — ic,,a"b” 
Yn=d 
yyy, = —2y" 
yyy’ya = 4g” 


yyy yay, = = 276 tt 
Pryy yy pa = 2yyhyy? + yPy yy’) 
yoy, = 0 
yoty?y, = 2y’o™ 
Traces: 
trl=4 
try*=0 (A-28) 
try>=0 
The trace of an odd product of y” matrices vanishes: 
tr (y°y") = 0 
try") = 49" 
tr (o””) = 0 . (A-29) 


tr (y"y"y?) = 0 
tr (yMy"y?y?) = 41g gh? — gh’gh® + gh") 
te (y°yty"y?y") = — ict??? = 4ibuyp 
tr (did2°** dan) = tt (dan*** dad) 
tr (1 °°" dan) = @1* 2 tr (A3*** fon) — G1 * 3 tr dogg?’ don) + 77° + Gyan th (d2"**fan-1) — (A-30) 
= 42 8(a;,° a;,)°**(a;,* @;,) 
eis the signature of the permutation i,/,°:'i,j,, and the sum runs over the (2n)!/2"n! different pairings 
satisfying 1 = i, < ig <°°' < in, iy < jg. 
Dirac spinors u and v solutions of the Dirac equation 
 — m)u(p) = 0 
(6 + mvp) = 0 
are functions of the on-shell momentum p, with p° = E, = /m + p? and are labeled by a polarization 


index a = 1, 2. 
Conjugate spinors: 


(A-31) 


a=uly? p= vty? (A-32) 
u'(p)(p — m) = 0 
5 (p)(B + m) = 0 


(A-33) 
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Normalization: 
ia’(p)u'(p) = 6%” 
5"(pjo(p) = — 3°? (A-34) 
d(p)u(p) = ai (p)v(p) = 0 

Density: 


5 
it (p)y9ulP(p) = ut p)ul(p) = Gulp) = =? 5? 


le 
B(p)y°v(p) = vt (p)v'(p) = — 5 (p)v(p) = — 5% (A-35) 
m 
Pp se (p°, ok P) 
Projection operators over the positive and negative energy states: 
+m e 
As(p) = ASL > up) @ u(p) 
2m a=1,2 
(A-36) 
m _ 
Ae — (2) (2) 
(p) = pe v(p) @ vp) 


Projectors over a definite polarization state along a space-like four-vector n orthogonal to p, 
n-p=0: 


+mi+ 
ulp,n) @ up, n) = 2 ™ * Ist 
(A-37) 
—0(p, n) ® slp, n) = "—* * tse 
For comments on helicity states, see Sec. 2-2-1. 
Gordon identities: 
1 
mi (p)yu(g) = = a ()[(p + a) + io(p — ghy]u'(q) 
' (A-38) 
Hi (p) yy? ul (q) = = wp) Llp — alty® + io(p + gvy>]u(Q) 
In particular: 
w"(p)quiP(p) = 59 28 
m 
(A-39) 


ut (pap) = 6% oe 
m 


A-3 NORMALIZATION OF STATES, § MATRIX, UNITARITY, 
AND CROSS SECTIONS 


Normalization of one-boson states: 
<p| p'> = 2w,(2n3)53(p — p’) (A-40) 


with w, = Re p? + m? and polarization indices omitted. 
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One-fermion states: 
@ 
<p|p> = ss (27)*5°(p — p’) (A-41) 


(For massless fermions such as neutrinos, it is safer to use a normalization of the form (A-40) in 
intermediate computations). 
S matrix and invariant scattering amplitude: 


S=1+iT 
A-42) 
<f| T|i> =e (2n)*5*(P, <= P)F ( 
Differential cross section for the scattering from an initial state i = {1,2} involving no massive fermion 
into a final state f = {3,4,....n): 
1 [Fs]? 
do = ———__________'_<"'_ 45, ---dp,(2n)*6“(P; — P A-43 
Als pay — mindy gs 6P3 APalnV OUP — Ps) A) 
The factor S is 
S=[]k;! (A-44) 
if there are k; identical particles of species i in the final state. The measure dp generally denotes 
d*p 
dp= A-45 
P= Qn)?20, le 
except for massive fermions for which 
. dpm 
dp = 7 (A-45b) 
(2x) w, 


Accordingly, if the incident particles | and/or 2 are massive fermions, the expression (A-43) has to be 
multiplied by 2m, and/or 2m. 

The formula (A-44) may have to be supplemented by an average over the initial polarizations 
and a summation over the final ones. 

The decay rate dt = d(t~*) of a particle of mass M into particles 3,4,...,n is given in its rest 
frame by the right-hand side of Eq. (A-43) with the flux factor 1/4[(p1* p2)? — mim3]'/? replaced by 
1/2M. The same modifications as above are to be brought when fermions are present. 

Differential cross section for two-body scattering 1 + 2 + 3 + 4 of nonidentical particles: 


do 
bs —= F(s. t)\? A-46a 
de Tae | 7(s, t)| ( ) 
d t 
or ak |F(s,)/? (A-46b) 


AQem 14] 642s 


in terms of the Mandelstam variables: s = (p, + p2)*, t = (py — p3)*, and of the initial and final center 
of mass momenta 


2 _ Ms, mi,m2) _ [s — (m1 + m2)" ]['s — (rm — m2)" 


s s 


4q 
7 (A-47) 
y] 2 
4q/2 = As, m3, m4) 
s 
Optical theorem: total cross section i—-:- in terms of the imaginary part of the forward elastic 


amplitude 7;,(s, t = 0): 
Im 7;;(s, t = 0) 


ae =”) (A-48) 
A1/2(s, m2, m3) 


Fro (i) = 
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Decomposition into partial wave amplitudes for spinless particles: 
Ts(s, t) = 16x ¥ (21 + 1)Pi(cos 6)F ‘i(s) (A-49) 
I 


with 5 
(m? — m3)(m3 — mi) 


SY 


4|q||q'|cos@=t—ut 
stt+u=mi+m3+m3+ mi 
Unitarity below the inelastic threshold results:in 


sil? 
a 2]q| e'4 sin 6,(s) (A-50) 


Generalization to particles with spin has been sketched in Chap. 5. 


A-4 FEYNMAN RULES 


Feynman rules for the computation of a definite Green function or scattering amplitude: 


1. Draw all possible topologically distinct diagrams—connected or disconnected but without vacuum- 
vacuum subdiagrams—contributing to the process under study, at the desired order. 
2. For each diagram, and to each internal line, attach a propagator: 


k ‘ | 
o——_-e === for a spin 0 boson (A-5S1) 
P || for a spin 3 fermion (A-52) 
a ae —m-+ it] pa ‘ 
k .£ Ipe — k pke/u? Kphelie 
p o ( k?—p?+ie =k? — p?/A +4 ie Aa 
. Gp (le A~")kpke | 
eS a eee A-53 
lp —p? tie (k? — p? + ie)(k? — p?/A + ie) ine) 


for a spin 1 boson of mass i in the Stueckelberg gauge, 1.e., endowed with a kinetic lagrangian 


“ 2 
ane eer ee ; (0+ A)? + oa 
3. To each vertex, assign a weight derived from the relevant monomial of the interaction lagrangian. 
It is composed of a factor coming from the degeneracy of identical particles in the vertex, of the 
coupling constant appearing in i#;,,, of possible tensors in internal indices, and of a momentum 
conservation delta function (22)*6*(Zp). To each field derivative @,¢ is associated —ip, where p 
is the corresponding incoming momentum. Vertices for the most common theories are listed below. 
4. Carry out the integration over all internal momenta with the measure d*k/(2z)*, possibly after a 
regularization. 
5. Multiply the contribution of each diagram by 
(a) a symmetry factor 1/S where S is the order of the permutation group of the internal lines 
and vertices leaving the diagram unchanged when the external lines are fixed; 
(b) a factor minus one for each fermion loop; and 


(c) a global sign for the external fermion lines, coming from their permutation as compared to the 
arguments of the Green function at hand (see Chap. 6). 
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These rules yield truncated functions with no factor on the external lines. Connected functions 
(2n)*5*(Zp)G.(py,-.., pn) are obtained by retaining only connected diagrams and by putting propagators 
(A-51) to (A-53) on the external lines. Contributions to proper Green functions i(27)*5*(Zp)T'(p,,..., Pa) 
come from one-particle irreducible diagrams. Finally, the scattering amplitude i7 (22)*5*(P; — Py) 
is obtained, up to renormalization, from the previous rules by putting the external lines on their 
mass shell, i.e., letting p? = m7, and providing external fermion lines with spinors u(p), v(q’'), ap’), 
b(q) according to whether the line enters or leaves the diagram and whether it belongs to the initial 
or final state (p = p’ = m, g = q' = —m). 


u(p) 


Initial state Final state 


Standard theories 
(a) @* theory 


Propagator (A-51) 
Vertex —id(2n)*6*(Zp) 


(b) Quantum electrodynamics 
A - 
= —4(0,A, — 0,A,)° — he - A)’ + Wid — eA — mp 


Photon propagator (A-53b) with »? = 0 
Fermion propagator (A-S2) 


m 


Vertex ne — ie(y,)pa(2x)*6*(Ep) 


a 86 


(c) Scalar electrodynamics 
a A, — 2 (@- A)? + [0, + eA, )@}'[0" + ieAX\o] — mot — 2 (poy? 
Lf = —4(0,A, — 0, BH aah [( “ le w)P 4 


Photon propagator (A-53) with uy? = 0 
Scalar propagator (A-51) oriented along the charge flow 
Vertices: 


—ie(py + p,)(2n)*5*(p — p’ — k) 
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Vector propagator as in (A-53b), with n* = 0 
Ghost 4 propagator as in (A-51) 
A minus sign for each ghost loop 
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Fermion-vector vertex : 
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Scalar-vector vertices: 
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Adier compatibility condition, 538 
Adler sum rule, 533, 688 
Adler-Weisberger sum rule, 540 
Analyticity properties, 245, 301 
Angular momentum: 

scalar field, 26, 117 

Dirac field, 52, 144 
Annihilation operators, 109, 114, 146 
Anomalies, 549, 605, 628 
Anomalous dimensions, 650, 656, 678, 684 
Anticommutation relations, 146 
Antilinear, antiunitary operator, 46 
Asymptotic freedom, 657 
Asymptotic (ultraviolet) behavior, 405, 654 
Axial gauge, 566, 584 


Bare parameters, 346 

Bargmann-Fock states (see Coherent states) 
Bargmann-Michel-Telegdi equation, 17 
Becchi-Rouet-Stora transformation, 595 
Bender-Wu formula, 473 

f-function, 639, 649, 653 

Bethe-Heitler cross section, 240 
Bethe-Salpeter equation, 480, 482 
Bhabha cross section, 281 

Bjorken’s inequality, 534 
Bjorken-Feynman scaling, 663 
Bloch-Nordsieck, 197 
Bogoliubov-Parasiuk-Hepp theorem, 397 
Bogoliubov recursion formula, 391 

Bohr radius, 72 

Borel transformation, 465 

Breit equation, 498 

Bremsstrahlung, 39, 238, 286, 352 
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Cabibbo angle, 527, 630 

Cabibbo-Radicati sum rule, 532 

Callan-Gross sum rule, 688 

Callan-Symanzik equations, 637, 649 

Casimir effect, 138 

Casimir operators, 592 

Charge conjugation, 85, 122, 152 

Charge renormalization, 346, 414 

Charged scalar field, 30, 120, 282 

Charm, 517, 628 

Chiral symmetry, 534, 541 

Chirality, 87 

Classical electromagnetic radius, 38 

Clifford algebra, 54 

Cluster property, 512 

Coherent states, 118, 170, 435 

Coleman theorem, 513 

Coleman-Weinberg effective potential, 454 

Collective modes, 470 

Color quantum number, 518, 553, 629, 659 

Compton scattering, 39, 224, 286, 536 

Confinement, 517, 659 

Conformal invariance, 642 

Connected Green functions, 212, 270 

Connected § matrix, 211 

Conservation laws, 19, 28, 509 

Conserved vector current (CVC) hypothesis, 527 

Constrained systems, 456, 573 

Convergence theorem for Feynman amplitudes, 
382 

Coulomb gauge, 10, 576 

Coulomb scattering, 94, 349 

Counterierms, 326, 385 

Counting of diagrams, 466 
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Covariant derivative, 31, 564 

CP violation, 160, 630 

Creation operators, 109, 114, 146 
Critical dimension (lower), 526 
Cross section, 201, 242, 245 
Crossing symmetry, 228, 231, 248 
Crossing matrix, 609 

Current algebra, 526 

Cutkosky rules, 315, 367 


Darwin term, 71 

Decay rate, 234 

Deep inelastic scattering (see Lepton-hadron 
scattering) 

Dilatation current, 645 

Dimensional regularization (see Regularization) 

Dirac equation, 48 

Dirac matrices, 49 

Dirac field (quantization of), 142 

Dispersion relations, 250 

Drell-Hearn sum rule, 537 

Dyson-Schwinger equations of motion, 447, 475 


Edge of the wedge theorem, 248 
Effective action, 294, 448 

in g* theory, 455 

in nonabelian gauge theories, 585 
Effective charge, effective coupling, 633, 671, 686 
Effective potential, 451 - 
Eikonal approximation, 434 
Electromagnetic field, 7 

classical motion in a constant, 14 

Dirac particle in a constant, 67 

Dirac propagator in a constant, 93, 100 

interacting with a classical source, 163 

quantization of the, 127, 461 
Electron-electron scattering, 276 
Electron-neutrino scattering, 624 
Electron-positron annihilation, 668 
Electron-positron scattering, 280 
Electroproduction, 661 
Energy-momentum tensor: 

classical, 22, 569 

for quantized fields, 117, 143 
Equivalence theorem, 448 
Euclidean Green functions, 126, 299 
Euler-Heisenberg lagrangian, 195 
Exceptional momenta, 402 


Faddeev-Popov ghosts, 582 
Fermi constant (G), 526 
Feynman diagrams, Feynman amplitudes, 264, 
287 
counting of, 466 
paramagnetic representation of, 294 
Feynman gauge, 134, 582 
Feynman-Kac formula, 467, 571 
Feynman propagator: 
of the electromagnetic field, 36, 133 
of the Klein-Gordon field, 35, 124 


Feynman propagator (continued) 
of the Dirac field, 91, 150 
of the massive vector field, 137 
Feynman rules, 265 
for quantum electrodynamics, 271 
for scalar electrodynamics, 285 
for nonabelian gauge theories, 582 
Fierz theorem, 160 
Fine structure, 75 
Fine structure constant (a), 38 
Fixed point (ultraviolet), 637, 641, 655 
Flavor quantum number, 514, 630 
Fock space, 112, 129, 146 
Foldy-Wouthuysen transformation, 69 
Forest of subdiagrams, 392 
Form factors, 159, 532 
to the one-loop order, 340 
Fredholm determinant, 187, 450, 471 
Froissart bound, 258 
Functional integrals, 425 
Furry theorem, 276 


Gauge: 
dependence of Green functions, 604, 651 
independence of the S matrix, 619 
transformations, 10, 31, 563, 565 
Gell-Mann low function, 633 
Gell-Mann matrices, 516 
Gell-Mann Nishijima formula, 515 
Gell-Mann Okubo relations, 519 
Generating functional : 
of S matrix elements, 208 
of connected Green functions, 212, 445 
of proper Green functions, 290 
Georgi-Glashow model, 616, 623 
Glashow-Iliopoulos-Maiani mechanism, 628 
Goldberger-Treiman relation, 535 
Goldstone bosons, 519, 549, 612 
Gordon identity, 60 
Grassmann anticommuting algebra, 182, 439 
Green functions: 
connected, 212, 270 
euclidean, 126, 299 
Feynman’s (see Feynman’s propagator) 
general, 211, 261 
generating functional of, 211, 445 
one-particle irreducible, 289 
proper, 289 
retarded, 33, 89 
truncated, 289 
Gribov ambiguities, 576 
Gupta-Bleuler quantization, 127 
Gyromagnetic ratio (g) (see Magnetic moment) 


Haffnian, 182 

Hard breaking, hard insertions, 547, 679 
Helicity, 59, 87, 148, 243 

Higgs boson, 614 

Hole theory, 84 

Hydrogen-like atoms, 72 


Hypercharge, 513 
Hyperfine structure, 79 
of positronium, 493 


Indefinite metric, 127 

Induced emission and absorption, 174 

In- and out- field and states, 165, 202 
Infinite momentum frame, 531, 664 
Infrared divergences, 40, 170, 241, 352, 404 
Incidence matrix, 294 

Instantons, 572 

Interaction representation, 177, 261 
Isospin symmetry, 252, 514 


Jacob-Wick helicity formalism, 243 
Jost-Lehmann-Dyson representation, 249 


Kallen-Lehmann representation, 203 
Karplus-Klein formula, 507 
Kinoshita theorem, 404 
Klein-Gordon equation, 32, 47, 108 
Klein-Nishina cross section, 229 
Klein paradox, 62 


Lamb shift, 80, 328, 358 

Landau equations, 302 

Landau gauge, 134, 582 

Landau ghost, 636 

Landau levels, 67 

Larmor’s formula, 37 

Large orders in perturbation theory, 463 

Lee-Nauenberg theorem, 404 

Legendre transformation, 2, 290 

Lehmann ellipses, 257 

Lehmann-Symanzik-Zimmermann 
theory, 202 

Lepton-hadron inclusive scattering, 533, 661 

polarization effects in, 668 
in nonabelian gauge theories, 685 

Lienard-Wiechert potential, 36 

Light-cone dynamics, 664 

Light-cone expansion, 681 

Loop-wise expansion, 287 

Lorentz-Dirac equation, 42 

Lorentz gauge, 12, 128, 567 

Lorentz transformations, 5 

Low energy theorems, 536, 549 


asymptotic 


Magnetic moment, 15, 66 
anomalous, 347, 626 
Majorama representation, 49 
Mandelstam representation, 313 
‘ Mandelstam variables, 242 
Mass insertion, 647, 656, 659 
Mass renormalization, 332 
Massless Dirac particle, 87 
Massless theories, 402 
Massive vector field, 135 
Maxwell equations, 7 
Mermin-Wagner theorem, 525 
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Minimal coupling, 31, 64, 567 
Mller cross section, 279 
Monopole (’t Hooft-Polyakov), 616 
Mott cross section, 98, 279 


Neutral current (weak), 623, 627 
Noether’s theorem, 23, 29 
Nonabelian gauge theories, 562 
Nonleptonic interactions, 680 
Nonlinear o-model, 549 
Normal kernel, 437 

Normal ordering, 111 
Normalization conditions, 384 


Octet dominance, 518 

Octet model, 513 

One-particle irreducible diagrams and Green 

functions, 289 

Operator product expansion, 672 

Operators (composite): 
anomalous dimensions of, 651 
hard, soft, 547, 679 
renormalization of, 399, 684 

Optical theorem, 241 

Overlapping divergences, 392, 415 


Pair annihilation, 230, 286 
Pair creation in an external field, 193 
Parametric representations of Feynman ampli- 
tudes, 294 
Parity, 53, 151, 244 
Partial conservation of the axial current (PCAC), 
SBh), S55 

Partial wave expansion, 243 
Parton model, 663 
Path integrals, 425 
Pauli equation, 65 
Pauli-Lubanski vector, 27, 52, 147 
PCT theorem, 157 
Pfaffian, 185 
Phase shift, 245 
Phase space measure: 

for bosons, 114 

for fermions, 147 
Phenomenological lagrangians, 549 
Photon-photon scattering, 355 
Photon mass, 138 
Pion decay constant (f,), 534 
Pion decay: 

charged, 528 

neutral, 550 
Pion-hadron scattering lengths, 539 
Pion-nucleon coupling, 255, 535 
Pion-nucleon scattering, 245, 252, 538 
Plane wave: 

classical particle in a, 18, 37 

Dirac particle in a, 68, 102 
Poincaré group, 5, 30, 52, 117 
Poisson bracket, 7, 457 
Pomeranchuk theorem, 259 
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Pontryagin index, 571 
Positronium: 
decay, 153, 233 
excited spectrum of, 493, 508 
hyperfine splitting of, 493 
Power counting, 344, 379 
Poynting vector, 25 
Proca equation, 135 
Propagator: 
complete, 289 
covariant, for vector fields, 220 
electron, 329 
(see also Green function and Feynman’s 
propagator) 
Proper diagrams and Green functions, 289 
Proper time method, 100 


Quantization: 
around a stationary configuration, 469 
canonical, 105 
of constrained systems, 456 
of nonabelian gauge theories, 574 
Quarks, 517, 553, 627, 663 


Radiation of a classical charged particle, 36 
Radiative corrections in quantum  electro- 
dynamics, 347 

Reduction formulas, 205, 208, 213, aul 
Regularization, 374 

dimensional, 375, 415, 589 

Pauli-Villars, 319, 375, 411, 555 
Renormalizable and nonrenormalizable theories, 

381 

Renormalization: 

finite, 397 

general theory of, 385 

minimal, 416 

multiplicative, 346, 389 

of composite operators, 399, 684 

of nonabelian gauge theories, 585, 594, 617 

of quantum electrodynamics, 318, 405 

of the o-model, 543 
Renormalization group, 399, 650 
Renormalization part, 390 
Representations of SU(2), 243 
Representations of SU(3), 515 
Rutherford formula, 96 
Rydberg constant (Ryd), 72, 364, 495 


Scalar electrodynamics, 282, 523 

Scale invariance, 641, 654 

Scaling, 663 

Schwinger’s action principle, 429 
Schwinger model, 560, 620 

Schwinger terms, 224, 530 

Seagull terms, 285, 531 

Self energy (self-mass), 291, 329 
Short-distance expansion, 666, 671, 672 
o-model, 540 

Singularities of Feynman integrals, 302 


Slavnov-Taylor identities, 594 
S matrix, 166, 176, 198, 204, 209, 445 
Soft insertion, 547, 679 
Soft photons (see Infrared divergences) 
Soft pion theorems, 538 
Solitons, 569, 616 
Spin-orbit interaction, 16, 71 
Spin-statistics theorem, 149 
Spontaneous symmetry breaking, 519, 534, 548, 
612 
Strangeness, 515 
Strength tensor, 565 
Structure constants, 28, 511 
Structure functions, 532, 662, 665 
moments of, 684 
Stueckelberg’s lagrangian, 136 
SU(3) symmetry, 515 
Superficial degree of divergence, 344, 380 
Super-renormalizable theories, 381 
Symmetries: 
and conservation laws, 19 
discrete, 151 
in quantum field theory, 509 
internal, 28 
spontaneously broken, 519, 534, 548, 612 
Symmetry factors, 265, 268 


Tadpole diagram, 271, 542 

Thomas precession, 15 

Thomson scattering, 39 

*t Hooft gauge, 619 

Threshold (normal), 307 
anomalous, 308, 311 
pseudo, 309 

Time-ordered product, 123, 150, 177 

Time reversal, 154, 244 

Tree diagram, 294 

Triality, 515 

Truncated diagrams and Green functions, 289 

Twist, 682 


Uehling’s potential, 327 

Ultraviolet behavior, 405, 632 
Ultraviolet divergences, 329, 343, 372 
Unitarity of the S matrix, 241, 619 
Unitary gauge, 617, 621 

Unitary symmetry (see Octet model) 


V-A coupling, 527 
Vacuum: 
degeneracy of the, 512, 571 
invariance or non invariance of the, 512, 519 
Vacuum diagrams, 266 
Vacuum fluctuations, 119, 138 
Vacuum polarization, 319, 415, 633 
Van der Waals forces, 141, 365 
Vertex function, 335 


Ward-Takahaski identities, 337, 407, 531, 547, 
594, 645 


Wave-function renormalization, 202, 334, 346 
Weak interactions, 526, 606, 620, 680 
Weight diagrams, 515 

Weinberg angle, 623 
Weinberg-Salam model, 620 
Weinberg theorem 405 

Welton’s argument, 80 

Weyl equations, 88 

Wick-Cutkosky model, 487 

Wick ordering, 111 

Wick rotation, 300, 485 

Wick theorem, 180, 447 

Wigner 2-functions, 243 
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Wigner’s theorem, 46, 151, 511 
Wilson’s criterion, 660 
Wilson expansion, 672 
Winding number, 570 


Yang-Mills field, 562 
massive, 606 
Yukawa coupling, 629, 658 


Zero mass limit, 402 

Zero point energy, 110, 139 
Zimmermann’s subtraction formula, 392 
Zitterbewegung, 62 
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QUANTUM 
FIELD THEORY 


Claude Itzykson and Jean-Bernard Zuber 


uantum field theory remains among the most important tools in 
Cee and explaining the microscopic world. Recent years have 
witnessed a blossoming of developments and applications that extend 
far beyond the theory’s original scope. This comprehensive text offers a 
balanced treatment, providing students with both a formal presentation 
and numerous practical examples of calculations. 


This two-part approach begins with the standard quantization of electro- 
dynamics, culminating in the perturbative renormalization. The second part 
comprises functional methods, relativistic bound states, broken symmetries, 
nonabelian gauge fields, and asymptotic behavior. Appropriate for students 
and researchers in field theory, particle physics, and related areas, this 

- treatment presupposes a background in quantum mechanics, electro- 
dynamics, and relativity, and it assumes some familiarity with classical 
calculus, including group theory and complex analysis. 
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